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Dísease and the genome. Studíes of the 
human genome are revealing disease origins and 
other biochemical mysteries. Human 
chromosomes, left, contain the DNA molecules 
that constitute the human genome. The staining 
pattern serves to identify specific regions of a 
chromosome. On the right is a diagram of 
human chromosome 7, with band q31.2 
indicated by an arrow. A gene in this region 
encodes a protein that, when maifunctioning, 
causes cystic fibrosis. [(Left) Alfred Pasieka/Peter 
Arnold.] 
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GACTTCACTTCTAATGATGATTATGGGAGAACTGGAGCCT 
TCAGAGGGTAAAAATTAAGCACAGTGGAAGAATTTCATTC 
TGTTCTCAGTTTTCCTGGATTATGCCTGGCACCATTAAAG 
AAAATATCTTTGGTGTTTCCTATGATGAATATAGATACAG 
AAGCGTCATCAAAGCATGCCAACTAGAAGAG.... This string of 
letters A, G, G, and T is a part of a DNA sequence. Since the biochemical 
techniques for DNA sequencing were first developed more than three 
decades ago, the genomes of dozens of organisms have been sequenced, and 
many more such sequences will be forthcoming. The information contained 
in these DNA sequences promises to shed light on many fascinating and 

important questions. What genes in Vibrio cholera, the bac- _ 

terium that causes cholera, for example, distinguish it from 
its more benign relatives? How is the development of com- 
plex organisms controlled? What are the evolutionary rela- 
tionships between organisms? 

Sequencing studies have led us to a tremendous land- 
mark in the history of biology and, indeed, humanity. A 
nearly complete sequence of the entire human genome has been 
determined. The string of As, Cs, Gs, and Ts with which 
we began this book is a tiny part of the human genome se¬ 
quence, which is more than 3 billion letters long. If we in- 
cluded the entire sequence, our opening sentence would fill 
more than 500,000 pages. 

The implications of this knowledge cannot be overesti- 
mated. By using this blueprint for much of what it means 
to be human, scientists can begin the identification and 
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characterization of sequences that foretell the appearance of specific dis- 
eases and particular physical attributes. One consequence will be the de- 
velopment of better means of diagnosing and treating diseases. Ultimately, 
physicians will be able to devise plans for preventing or managing heart dis- 
ease or câncer that take account of individual variations. Although the se- 
quencing of the human genome is an enormous step toward a complete un- 
derstanding of living systems, much work needs to be done. Where are the 
functional genes within the sequence, and how do they interact with one an- 
other? How is the information in genes converted into the functional char- 
acteristics of an organism? Some of our goals in the study of biochemistry 
are to learn the concepts, tools, and facts that will allow us to address these 
questions. It is indeed an exciting time, the beginning of a new era in bio¬ 
chemistry. 


1.1 DNA ILLUSTRATES THE RELATION BETWEEN 
FORM AND FUNCTION 

The structure of DNA, an abbreviation for deoxyribonucleic acid, illustrates 
a basic principie common to all biomolecules: the intimate relation between 
structure and function. The remarkable properties of this chemical sub- 
stance allow it to function as a very efficient and robust vehicle for storing 
information. We begin with an examination of the covalent structure of 
DNA and its extension into three dimensions. 

1.1.1 DNA Is Constructed from Four Building Blocks 

DNA is a linear polymer made up of four different monomers. It has a fixed 
backbone from which protrude variable substituents (Figure 1.1). The back¬ 
bone is built of repeating sugar-phosphate units. The sugars are molecules 
of deoxyribose from which DNA receives its name. Joined to each deoxyri- 
bose is one of four possible bases: adenine (A), cytosine (C), guanine (G), 
and thymine (T). 



Adenine (A) 


Cytosine (C) 


Guanine (G) 


Thymine (T) 


All four bases are planar but differ significantly in other respects. Thus, the 
monomers of DNA consist of a sugar-phosphate unit, with one of four bases 
attached to the sugar. These bases can be arranged in any order along a strand 
of DNA. The order of these bases is what is displayed in the sequence that 
begins this chapter. For example, the first base in the sequence shown is G 


FIGURE 1.1 Covalent structure of 
DNA. Each unit of the polymeric structure 
is composed of a sugar (deoxyribose), 
a phosphate, and a variable base that 
protrudes from the sugar-phosphate 
backbone. 
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DNA: Form and Function 


(guanine), the second is A (adenine), and so on. The sequence of bases along 
a DNA strand constitutes the genetic information —the instructions for as- 
sembling proteins, which themselves orchestrate the synthesis of a host of 
other biomolecules that form cells and ultimately organisms. 


1.1.2 Two Single Strands of DNA Combine 
to Form a Double Helix 

Most DNA molecules consist of not one but two strands 
(Figure 1.2). How are these strands positioned with re- 
spect to one another? In 1953, James Watson and Francis 
Crick deduced the arrangement of these strands and pro- 
posed a three-dimensional structure for DNA molecules. 

This structure is a double helix composed of two inter- 
twined strands arranged such that the sugar-phosphate 
backbone lies on the outside and the bases on the inside. The key to this 
structure is that the bases form specific base pairs (bp) held together by hyd- 
rogen bonds (Section 1.3.1): adenine pairs with thymine (A-T) and guanine 
pairs with cytosine (G—C), as shown in Figure 1.3. Hydrogen bonds are much 
weaker than covalent bonds such as the carbon—carbon or carbon—nitrogen 
bonds that define the structures of the bases themselves. Such weak bonds 
are crucial to biochemical systems; they are weak enough to be reversibly 
broken in biochemical processes, yet they are strong enough, when many 
form simultaneously, to help stabilize specific structures such as the dou¬ 
ble helix. 




FIGURE 1.2 The double helix. The 

double-helical structure of DNA proposed 
by Watson and Crick. The sugar-phosphate 
backbones of the two chains are shown in 
red and blue and the bases are shown in 
green, purple, orange, and yellow. 




Adenine (A) 


Thymine (T) 


Guanine (G) 


Cytosine (C) 


FIGURE 1.3 Watson-Críck base pairs. 

Adenine pairs with thymine (A-T), and 
guanine with cytosine (G-C). The dashed 
lines represent hydrogen bonds. 


The structure proposed by Watson and Crick has two properties of cen¬ 
tral importance to the role of DNA as the hereditary material. First, the 
structure is compatible with any sequence of bases. The base pairs have es- 
sentially the same shape (Figure 1.4) and thus fit equally well into the cen- 
ter of the double-helical structure. Second, because of base-pairing, the se¬ 
quence of bases along one strand completely determines the sequence along the 
other strand. As Watson and Crick so coyly wrote: 'Tt has not escaped our 
notice that the specific pairing we have postulated immediately suggests a 
possible copying mechanism for the genetic material.’' Thus, if the DNA 
double helix is separated into two single strands, each strand can act as a 
template for the generation of its partner strand through specific base-pair 
formation (Figure 1.5). The three-dimensional structure of DNA beautifully 
illustrates the dose connection between molecular form and function. 


FIGURE 1.4 Base-pairing in DNA. 

The base-pairs A-T (blue) and C-G (red) 
are shown overlaid. The Watson-Crick 
base-pairs have the same overall size and 
shape, allowing them to fit neatly within 
the double helix. 




FIGURE 1.5 DNA replication. If a DNA 

molecule is separated into two strands, 
each strand can act as the template for 
the generation of its partner strand. 
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1.1.3 RNA Is an Intermediate in the Flow of Genetic 
Information 

An important nucleic acid in addition to DNA is ribonucleic acid (RNA). 
Some viruses use RNA as the genetic material, and even those organisms 
that employ DNA must first convert the genetic information into RNA for 
the information to be accessible or functional. Structurally, RNA is quite 
similar to DNA. It is a linear polymer made up of a limited number of re- 
peating monomers, each composed of a sugar, a phosphate, and a base. The 
sugar is ribose instead of deoxyribose (hence, RNA) and one of the bases is 
uracil (U) instead of thymine (T). Unlike DNA, an RNA molecule usually 
exists as a single strand, although significant segments within an RNA mol¬ 
ecule may be double stranded, with G pairing primarily with C and A pair- 
ing with U. This intrastrand base-pairing generates RNA molecules with 
complex structures and activities, including catalysis. 

RNA has three basic roles in the cell. First, it serves as the intermediate 
in the flow of information from DNA to protein, the primary functional 
molecules of the cell. The DNA is copied, or transcribed, into messenger 
RNA (mRNA), and the mRNA is translated into protein. Second, RNA 
molecules serve as adaptors that translate the information in the nucleic acid 
sequence of mRNA into information designating the sequence of con- 
stituents that make up a protein. Finally, RNA molecules are important 
functional components of the molecular machinery, called ribosomes, that 
carries out the translation process. As will be discussed in Chapter 2, the 
unique position of RNA between the storage of genetic information in DNA 
and the functional expression of this information as protein as well as its 
potential to combine genetic and catalytic capabilities are indications that 
RNA played an important role in the evolution of life. 

1.1.4 Proteins, Encoded by Nucleic Acids, Perform Most Cell 
Functions 

A major role for many sequences of DNA is to encode the sequences oíprot¬ 
eins, the workhorses within cells, participating in essentially all processes. 
Some proteins are key structural components, whereas others are specific 
catalysts (termed enzymes) that promote chemical reactions. Like DNA and 
RNA, proteins are linear polymers. However, proteins are more compli- 
cated in that they are formed from a selection of 20 building blocks, called 
amino acids, rather than 4. 

The functional properties of proteins, like those of other biomolecules, 
are determined by their three-dimensional structures. Proteins possess an 
extremely important property: a protein spontaneously folds into a well- 
defined and elaborate three-dimensional structure that is dictated entirely 
by the sequence of amino acids along its chain (Figure 1.6). The self-folding 
nature of proteins constitutes the transition from the one-dimensional world of 
sequence information to the three-dimensional world ofbiologicalfunction. This 
marvelous ability of proteins to self assemble into complex structures is re- 
sponsible for their dominant role in biochemistry. 

How is the sequence of bases along DNA translated into a sequence of 
amino acids along a protein chain? We will consider the details of this process 
in later chapters, but the important finding is that three bases along a DNA 
chain encode a single amino acid. The specific correspondence between a set 
of three bases and 1 of the 20 amino acids is called the genetic code. Like the 
use of DNA as the genetic material, the genetic code is essentially univer¬ 
sal; the same sequences of three bases encode the same amino acids in all 
life forms from simple microorganisms to complex, multicellular organisms 
such as human beings. 




Unity Underlies Diversity 



Knowledge of the functional and structural properties of proteins is ab- 
solutely essential to understanding the significance of the human genome 
sequence. For example, the sequence at the beginning of this chapter cor- 
responds to a region of the genome that differs in people who have the ge- 
netic disorder cystic fibrosis. The most common mutation causing cystic fi- 
brosis, the loss of three consecutive Ts from the gene sequence, leads to the 
loss of a single amino acid within a protein chain of 1480 amino acids. This 
seemingly slight difference—a loss of 1 amino acid of nearly 1500—creates 
a life-threatening condition. What is the normal function of the protein en- 
coded by this gene? What properties of the encoded protein are compro- 
mised by this subtle defect? Can this knowledge be used to develop new 
treatments? These questions fali in the realm of biochemistry. Knowledge 
of the human genome sequence will greatly accelerate the pace at which con- 
nections are made between DNA sequences and disease as well as other hu¬ 
man characteristics. However, these connections will be nearly meaningless 
without the knowledge of biochemistry necessary to interpret and exploit 
them. 


1.2 BIOCHEMICAL UNITY UNDERLIES 
BIOLOGICAL DIVERSITY 

The stunning variety of living systems (Figure 1.7) belies a striking simi- 
larity. The common use of DNA and the genetic code by all organisms un¬ 
derlies one of the most powerful discoveries of the past century—namely, 
that organisms are remarkably uniform at the molecular levei. All organisms 
are built from similar molecular components distinguishable by relatively 
minor variations. This uniformity reveals that all organisms on Earth have 



FIGURE 1.6 Folding of a protein. The 

three-dimensional structure of a protein, a 
linear polymer of amino acids, is dictated 
by its amino acid sequence. 


Cystic fibrosis— 

A disease that resuits from a decrease 
in fluid and sait secretion by a trans- 
port protein referred to as the cystic 
fibrosis transmembrane conductance 
regulator (CFTR). As a result of this 
defect, secretion from the pancreas is 
blocked, and heavy, dehydrated mucus 
accumulates in the lungs, leading to 
chronic lung infections. 


FIGURE 1.7 The diversity of living 
Systems. The distinct morphologies of the 
three organisms shown—a plant (the false 
hellebora, or Indian poke) and two 
animais (sea urchins and a common 
house cat)-might suggest that they have 
littie in common. Yet biochemically they 
display a remarkable commonality that 
attests to a common ancestry. [(Left and 
right) John Dudak/Phototake. (Middie) Jeffrey L. 
Rotman/Peter Arnold.] 














arisen from a common ancestor. A core of essential bio- 
chemical processes, common to all organisms, appeared 
early in the evolution of life. The diversity of life in the 
modern world has been generated by evolutionary 
processes acting on these core processes through mil- 
lions or even billions of years. As we will see repeatedly, 
the generation of diversity has very often resulted from 
the adaptation of existing biochemical components to 
new roles rather than the development of fundamentally 
new biochemical technology. The striking uniformity of 
life at the molecular levei affords the student of bio- 
chemistry a particularly clear view into the essence of 
biological processes that applies to all organisms from 
human beings to the simplest microorganisms. 

On the basis of their biochemical characteristics, the 
diverse organisms of the modern world can be divided 
into three fundamental groups called domains: Eukarya 
(eukaryotes), Bactéria (formerly Eubacteria), and Arc- 
haea (formerly Archaebacteria). Eukarya comprise all 
macroscopic organisms, including human beings as well 
as many microscopic, unicellular organisms such as 
yeast. The defining characteristic of eukaryotes is the presence of a well-de- 
fined nucleus within each cell. Unicellular organisms such as bactéria, which 
lack a nucleus, are referred to as prokaryotes. The prokaryotes were reclas- 
sified as two separate domains in response to Cari Woese s discovery in 1977 
that certain bactéria-like organisms are biochemically quite distinct from 
better-characterized bacterial species. These organisms, now recognized as 
having diverged from bactéria early in evolution, are archaea. Evolutionary 
paths from a common ancestor to modern organisms can be developed and 
analyzed on the basis of biochemical Information. One such path is shown 
in Figure 1.8. 

By examining biochemistry in the context of the tree of life, we can of¬ 
ten understand how particular molecules or processes helped organisms 
adapt to specific environments or life styles. We can ask not only what bio¬ 
chemical processes take place, but also why particular strategies appeared in 
the course of evolution. In addition to being sources of historical insights, 
the answers to such questions are often highly instructive with regar d to the 
biochemistry of contemporary organisms. 
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FIGURE 1.8 The tree of life. A possible evolutionary path from a 
common ancestral cell to the diverse species present in the 
modern world can be deduced from DNA sequence analysis. 


1.3 CHEMICAL BONDS IN BIOCHEMISTRY 


The essence of biological processes—the basis of the uniformity of liv¬ 
ing Systems—is in its most fundamental sense molecular interactions; in 
other words, the chemistry that takes place between molecules. Bio¬ 
chemistry is the chemistry that takes place within living systems. To truly 
understand biochemistry, we need to understand chemical bonding. We 
review here the types of chemical bonds that are important for biochem- 
icals and their transformations. 

The strongest bonds that are present in biochemicals are covalent bonds, 
such as the bonds that hold the atoms together within the individual bases 
shown in Figure 1.3. A covalent bond is formed by the sharing of a pair of 
electrons between adjacent atoms. A typicai carbon-carbon (C—C) covalent 
bond has a bond length of 1.54 A and bond energy of 85 kcal moU^ 
(356 kj moU^). Because this energy is relatively high, considerable energy 
must be expended to break covalent bonds. More than one electron pair can 



be shared between two atoms to form a multiple covalent bond. For exam- 
ple, three of the bases in Figure 1.4 include carbon—oxygen (C=0) double 
bonds. These bonds are even stronger than C-C single bonds, with ener- 
gies near 175 kcal moP^ (732 kj moP^). 

For some molecules, more than one pattern of covalent bonding can be 
written. For example, benzene can be written in two equivalent ways called 
resonance structures. Benzene^s true structure is a composite of its two reso- 
nance structures. A molecule that can be written as several resonance struc¬ 
tures of approximately equal energies has greater stability than does a mol¬ 
ecule without multiple resonance structures. Thus, because of its resonance 
structures, benzene is unusually stable. 

Chemical reactions entail the breaking and forming of covalent bonds. 
The flow of electrons in the course of a reaction can be depicted by curved 
arrows, a method of representation called ‘‘arrow pushing.'' Each arrow rep- 
resents an electron pair. 
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Chemical Bonds 



Benzene resonance 
structures 


1.3.1 Reversible Interactions of Biomolecules Are Mediated by 
Three Kinds of Noncovalent Bonds 

Readily reversible, noncovalent molecular interactions are key steps in the 
dance of life. Such weak, noncovalent forces play essential roles in the faithful 
replication of DNA, the folding of proteins into intricate three-dimensional 
forms, the specific recognition of substrates by enzymes, and the detection 
of molecular signals. Indeed, all biological structures and processes depend 
on the interplay of noncovalent interactions as well as covalent ones. The 
three fundamental noncovalent bonds are electrostatic interactions, hydrogen 
bonds, and van der Waals interactions. They differ in geometry, strength, and 
specificity. Furthermore, these bonds are greatly affected in different ways by 
the presence of water. Let us consider the characteristics of each: 

1. Electrostatic interactions. An electrostatic interaction depends on the elec- 
tric charges on atoms. The energy of an electrostatic interaction is given by 
Coulomb 's lavo: 


E = kqiq2/Dr 

where E is the energy, qi and q 2 are the charges on the two atoms (in units 
of the electronic charge), r is the distance between the two atoms (in 
angstroms), D is the dielectric constant (which accounts for the effects of 
the intervening médium), and fe is a proportionality constant (fe = 332, to 
give energies in units of kilocalories per mole, or 1389, for energies in kilo- 
joules per mole). Thus, the electrostatic interaction between two atoms bear- 
ing single opposite charges separated by 3 À in water (which has a dielec¬ 
tric constant of 80) has an energy of 1.4 kcal moP^ (5.9 kJ moP^). 

2. Hydrogen bonds. Hydrogen bonds are relatively weak interactions, which 
nonetheless are crucial for biological macromolecules such as DNA and pro¬ 
teins. These interactions are also responsible for many of the properties of 
water that make it such a special solvent. The hydrogen atom in a hydro¬ 
gen bond is partly shared between two relatively electronegative atoms such 
as nitrogen or oxygen. The hydrogen-bond donor is the group that includes 
both the atom to which the hydrogen is more tightly linked and the hydrogen 
atom itself, whereas the hydrogen-bond acceptor is the atom less tightly linked 
to the hydrogen atom (Figure 1.9). Hydrogen bonds are fundamentally 


r 


Hydrogen- Hydrogen- 
bond donor bond acceptor 
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FIGURE 1.9 Hydrogen bonds that 
include nitrogen and oxygen atoms. 

The positions of the partial charges 
(ô^ and ô~) are shown. 
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FIGURE 1.10 Energy of a van der Waals 
ínteractíon as two atoms approach 
one another. The energy is most favorable 
at the van der Waals contact distance. The 
energy rises rapidiy owing to electron- 
electron repulsion as the atoms move 
closer together than this distance. 


electrostatic interactions. The relatively electronegative atom to which the 
hydrogen atom is covalently bonded pulls electron density away from the 
hydrogen atom so that it develops a partial positive charge (S^). Thus, it 
can interact with an atom having a partial negative charge (S~) through an 
electrostatic interaction. 

Hydrogen bonds are much weaker than covalent bonds. They have en- 
ergies of 1-3 kcal moP^ (4-13 kj moP^) compared with approximately 
100 kcal moP^ (418 kJ mol~^) for a carbon-hydrogen covalent bond. 
Hydrogen bonds are also somewhat longer than are covalent bonds; their bond 
distances (measured from the hydrogen atom) range from 1.5 to 2.6 A; 
hence, distances ranging from 2.4 to 3.5 À separate the two nonhydrogen 
atoms in a hydrogen bond. The strongest hydrogen bonds have a tendency 
to be approximately straight, such that the hydrogen-bond donor, the hy¬ 
drogen atom, and the hydrogen-bond acceptor lie along a straight line. 

3. van der Waals interactions. The basis of a van der Waals interaction is 
that the distribution of electronic charge around an atom changes with time. 
At any instant, the charge distribution is not perfectly symmetric. This tran- 
sient asymmetry in the electronic charge around an atom acts through elec¬ 
trostatic interactions to induce a complementary asymmetry in the electron 
distribution around its neighboring atoms. The resulting attraction between 
two atoms increases as they come closer to each other, until they are sepa- 
rated by the van der Waals contact distance (Figure 1.10). At a shorter dis¬ 
tance, very strong repulsive forces become dominant because the outer elec¬ 
tron clouds overlap. 

Energies associated with van der Waals interactions are quite small; typ- 
ical interactions contribute from 0.5 to 1.0 kcal moP^ (from 2 to 4 kJ moP^) 
per atom pair. When the surfaces of two large molecules come together, 
however, a large number of atoms are in van der Waals contact, and the net 
effect, summed over many atom pairs, can be substantial. 





FIGURE 1.11 Structure of ice. Hydrogen bonds (shown as 
dashed lines) are formed between water molecules. 


1.3.2 The Properties of Water Affect the Bonding Abilities 
of Biomolecules 

Weak interactions are the key means by which molecules interact with one 
another—enzymes with their substrates, hormones with their receptors, an- 
tibodies with their antigens. The strength and specificity of weak interac¬ 
tions are highly dependent on the médium in which they take place, and the 
majority of biological interactions take place in water. Two 
properties of water are especially important biologically: 

1. Water is a polar molecule. The water molecule is bent, not 
linear, and so the distribution of charge is asymmetric. The 
oxygen nucleus draws electrons away from the hydrogen nu- 
clei, which leaves the region around the hydrogen nuclei with 
a net positive charge. The water molecule is thus an electri- 
cally polar structure. 

2. Water is highly cohesive. Water molecules interact strongly 
with one another through hydrogen bonds. These interac¬ 
tions are apparent in the structure of ice (Figure 1.11). Net¬ 
works of hydrogen bonds hold the structure together; simi¬ 
lar interactions link molecules in liquid water and account for 
the cohesion of liquid water, although, in the liquid State, 
some of the hydrogen bonds are broken. The highly cohesive 
nature of water dramatically affects the interactions between 
molecules in aqueous solution. 






What is the effect of the properties of water on the weak interactions dis- 
cussed in Section 1.3.1? The polarity and hydrogen-bonding capability of 
water make it a highly interacting molecule. Water is an excellent solvent 
for polar molecules. The reason is that water greatly weakens electrostatic 
forces and hydrogen bonding between polar molecules by competing for 
their attractions. For example, consider the effect of water on hydrogen 
bonding between a carbonyl group and the NH group of an amide. 



A hydrogen atom of water can replace the amide hydrogen atom as a 
hydrogen-bond donor, whereas the oxygen atom of water can replace the 
carbonyl oxygen atom as a hydrogen-bond acceptor. Hence, a strong hy¬ 
drogen bond between a CO group and an NH group forms only if water is 
excluded. 

The dielectric constant of water is 80, so water diminishes the strength 
of electrostatic attractions by a factor of 80 compared with the strength of 
those same interactions in a vacuum. The dielectric constant of water is un- 
usually high because of its polarity and capacity to form oriented solvent 
shells around ions. These oriented solvent shells produce electric fields of 
their own, which oppose the fields produced by the ions. Consequently, the 
presence of water markedly weakens electrostatic interactions between ions. 

The existence of life on Earth depends critically on the capacity of wa¬ 
ter to dissolve a remarkable array of polar molecules that serve as fuels, 
building blocks, catalysts, and Information carriers. High concentrations of 
these polar molecules can coexist in water, where they are free to diffuse and 
interact with one another. However, the excellence of water as a solvent 
poses a problem, because it also weakens interactions between polar mole¬ 
cules. The presence of water-free microenvironments within biological Sys¬ 
tems largely circumvents this problem. We will see many examples of these 
specially constructed niches in protein molecules. Moreover, the presence 
of water with its polar nature permits another kind of weak interaction to 
take place, one that drives the folding of proteins (Section 1.3.4) and the 
formation of cell boundaries (Section 12.3). 

The essence of these interactions, like that of all interactions in bio- 
chemistry, is energy. To understand much of biochemistry—bond forma¬ 
tion, molecular structure, enzyme catalysis—we need to understand energy. 
Thermodynamics pr o vides a valuable tool for approaching this topic. We 
will revisit this topic in more detail when we consider enzymes (Chapter 8) 
and the basic concepts of metabolism (Chapter 14). 

1.3.3 Entropy and the Laws of Thermodynamics 

The highly structured, organized nature of living organisms is apparent 
and astonishing. This organization extends from the organismal through 
the cellular to the molecular levei. Indeed, biological processes can seem 
magicai in that the well-ordered structures and patterns emerge from the 
chaotic and disordered world of inanimate objects. However, the organi¬ 
zation visible in a cell or a molecule arises from biological events that are 
subject to the same physical laws that govern all processes—in particular, 
the laws of thermodynamics. 
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How can we understand the creation of order out of chãos? We begin by 
noting that the laws of thermodynamics make a distinction between a system 
and its surroundings. A system is defined as the matter within a defined region 
of space. The matter in the rest of the universe is called the surroundings. The 
First Lavo of Thermodynamics States that the total energy of a system and its sur¬ 
roundings is constant. In other words, the energy content of the universe is con- 
stant; energy can be neither created nor destroyed. Energy can take different 
forms, however. Heat, for example, is one form of energy. Heat is a manifes- 
tation of the kinetic energy associated with the random motion of molecules. 
Alternatively, energy can be present as potential energy, referring to the ability 
of energy to be released on the occurrence of some process. Consider, for ex¬ 
ample, a bali held at the top of a tower. The bali has considerable potential en¬ 
ergy because, when it is released, the bali will develop kinetic energy associ¬ 
ated with its motion as it falis. Within chemical systems, potential energy is 
related to the likelihood that atoms can react with one another. For instance, a 
mixture of gasoline and oxygen has much potential energy because these mol¬ 
ecules may react to form carbon dioxide and release energy as heat. The First 
Law requires that any energy released in the formation of chemical bonds be 
used to break other bonds, be released as heat, or be stored in some other form. 

Another important thermodynamic concept is that of entropy. Entropy is 
a measure of the levei of randomness or disorder in a system. The Second Law 
of Thermodynamics States that the total entropy of a system and its surround¬ 
ings always increasesfor a spontaneous process. At first glance, this law appears 
to contradict much common experience, particularly about biological Sys¬ 
tems. Many biological processes, such as the generation of a well-defined 
structure such as a leaf from carbon dioxide gas and other nutrients, clearly 
increase the levei of order and hence decrease entropy. Entropy may be de- 
creased locally in the formation of such ordered structures only if the entropy 
of other parts of the universe is increased by an equal or greater amount. 

An example may help clarify the application of the laws of thermody¬ 
namics to a chemical system. Consider a Container with 2 moles of hydro- 
gen gas on one side of a divider and 1 mole of oxygen gas on the other (Figure 
1.12). If the divider is removed, the gases will intermingle spontaneously to 
form a uniform mixture. The process of mixing increases entropy as an or¬ 
dered arrangement is replaced by a randomly distributed mixture. 

Other processes within this system can decrease the entropy locally while 
increasing the entropy of the universe. A spark applied to the mixture initiates 
a chemical reaction in which hydrogen and oxygen combine to form water: 

2 H2 + O2- > 2 H2O 

If the temperature of the system is held constant, the entropy of the system 
decreases because 3 moles of two differing reactants have been combined to 
form 2 moles of a single product. The gas now consists of a uniform set of 
indistinguishable molecules. However, the reaction releases a significant 
amount of heat into the surroundings, and this heat will increase the en¬ 
tropy of the surrounding molecules by increasing their random movement. 
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FIGURE 1.12 From order to disorder. The spontaneous mixing of gases is driven by an 
increase in entropy. 
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FIGURE 1.13 Entropy changes. When hydrogen and oxygen combine to 
form water, the entropy of the system is reduced, but the entropy of the 
universe is increased owing to the release of heat to the surroundings. 


Heat 


The entropy increase in the surroundings is enough to allow water to form 
spontaneously from hydrogen and oxygen (Figure 1.13). 

The change in the entropy of the surroundings will be proportional to 
the amount of heat transferred from the system and inversely proportional 
to the temperature of the surroundings, because an input of heat leads to a 
greater increase in entropy at lower temperatures than at higher tempera- 
tures. In biological systems, T [in kelvin (K), absolute temperature] is as- 
sumed to be constant. If we define the heat content of a system as enthalpy 
(H), then we can express the relation linking the entropy fSJ of the sur¬ 
roundings to the transferred heat and temperature as a simple equation: 

surroundings ^ffsystem/f 

The total entropy change is given by the expression 
ASI = ASI + ASI 

total System ' ^‘^surroundings 

Substituting equation 1 into equation 2 yields 

^'^total ^'^system ^-^^system''^'^ 

Multiplying by — T gives 

'^^'^total ^-f^system system 

The function — TAS has units of energy and is referred to as/ree energy or 
Gibbs free energy, after Josiah Willard Gibbs, who developed this function 
in 1878: 


( 1 ) 

( 2 ) 

(3) 

(4) 


AG = Afígystem - 'TASgystem (5) 

The free-energy change, AG, will be used throughout this book to describe 
the energetics of biochemical reactions. 

Recall that the Second Law of Thermodynamics States that, for a reac- 
tion to be spontaneous, the entropy of the universe must increase. Exami- 
nation of equation 3 shows that the total entropy will increase if and only if 

system ^ ff system /T (6) 

Rearranging gives TASgystem ^ or entropy will increase if and only if 

— Afígystem “ f^Sgystem ^0 (2) 

In other words, the free-energy change must be negative for a reaction to be 
spontaneous. A negative free-energy change occurs with an increase in the 
overall entropy of the universe. Thus, we need to consider only one term, 
the free energy of the system, to decide whether a reaction can occur spon¬ 
taneously; any effects of the changes within the system on the rest of the 
universe are automatically taken into account. 
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FIGURE 1.T4 Protein folding. Protein 
folding entails the transition from a 
disordered mixture of unfolded molecules 
to a relatively uniform solution of folded 
protein molecules. 


FIGURE 1.15 The hydrophobic effect. 

The aggregation of nonpolar groups in 
water leads to an increase in entropy 
owing to the release of water molecules 
into bulk water. 


1.3.4 Protein Folding Can Be Understood in Terms 
of Free-Energy Changes 

The problem of protein folding illustrates the utility of the concept of free 
energy. Consider a system consisting of a solution of unfolded protein mol¬ 
ecules in aqueous solution (Figure 1.14). Each unfolded protein molecule 
can adopt a unique conformation, so the system is quite disordered and the 
entropy of the collection of molecules is relatively high. Yet, protein folding 
proceeds spontaneously under appropriate conditions. Thus, entropy must 
be increasing elsewhere in the system or in the surroundings. How can we 
reconcile the apparent contradiction that proteins spontaneously assume an 
ordered structure, and yet entropy increases? The entropy decrease in the 
system on folding is not as large as it appears to be, because of the proper- 
ties of water. Molecules in aqueous solution interact with water molecules 
through the formation of hydrogen and ionic interactions. However, some 
molecules (termed nonpolar molecules) cannot participate in hydrogen or ionic 
interactions. The interactions of nonpolar molecules with water are not as fa- 
vorable as are interactions between the water molecules themselves. The wa¬ 
ter molecules in contact with these nonpolar surfaces form ‘‘cages'' around 
the nonpolar molecule, becoming more well ordered (and, hence, lower in 
entropy) than water molecules free in solution. As two such nonpolar mol¬ 
ecules come together, some of the water molecules are released, and so they 
can interact freely with bulk water (Figure 1.15). Hence, nonpolar molecules 
have a tendency to aggregate in water because the entropy of the water is in- 
creased through the release of water molecules. This phenomenon, termed 
the hydrophobic effect, helps promote many biochemical processes. 

How does the hydrophobic effect favor protein folding? Some of the 
amino acids that make up proteins have nonpolar groups. These nonpolar 
amino acids have a strong tendency to associate with one another inside the 
interior of the folded protein. The increased entropy of water resulting from 
the interaction of these hydrophobic amino acids helps to compensate for 
the entropy losses inherent in the folding process. 

Hydrophobic interactions are not the only means of stabilizing protein 
structure. Many weak bonds, including hydrogen bonds and van der Waals 
interactions, are formed in the protein-folding process, and heat is released 
into the surroundings as a consequence. Although these interactions replace 
interactions with water that take place in the unfolded protein, the net re- 
sult is the release of heat to the surroundings and thus a negative (favorable) 
change in enthalpy for the system. 

The folding process can occur when the combination of the entropy as- 
sociated with the hydrophobic effect and the enthalpy change associated 
with hydrogen bonds and van der Waals interactions makes the overall free 
energy negative. 
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Biochemistry and Human Biology 


Our understanding of biochemistry has had and will continue to have ex- 
tensive effects on many aspects of human endeavor. First, biochemistry is an 
intrinsically beautiful and fascinating body of knowledge. We now know the 
essence and many of the details of the most fundamental processes in bio¬ 
chemistry, such as how a single molecule of DNA replicates to generate two 
identical copies of itself and how the sequence of bases in a DNA molecule 
determines the sequence of amino acids in an encoded protein. Our ability 
to describe these processes in detailed, mechanistic terms places a firm Chem¬ 
ical foundation under other biological Sciences. Moreover, the realization that 
we can understand essential life processes, such as the transmission of hered- 
itary information, as chemical structures and their reactions has significant 
philosophical implications. What does it mean, biochemically, to be human? 
What are the biochemical differences between a human being, a chimpanzee, 
a mouse, and a fruit fly? Are we more similar than we are different? 

Second, biochemistry is greatly influencing medicine and other fields. The 
molecular lesions causing sickle-cell anemia, cystic fibrosis, hemophilia, and 
many other genetic diseases have been elucidated at the biochemical levei. 
Some of the molecular events that contribute to câncer development have 
been identified. An understanding of the underlying defects opens the door 
to the discovery of effective therapies. Biochemistry makes possible the ra- 
tional design of new drugs, including specific inhibitors of enzymes required 
for the replication of viruses such as human immunodeficiency virus (HIV). 
Genetically engineered bactéria or other organisms can be used as ' Tacto- 
ries'' to produce valuable proteins such as insulin and stimulators of blood- 
cell development. Biochemistry is also contributing richly to clinicai diag- 
nostics. For example, elevated leveis of telltale enzymes in the blood reveal 
whether a patient has recently had a myocardial infarction (heart attack). 
DNA probes are coming into play in the precise diagnosis of inherited dis- 
orders, infectious diseases, and cancers. Agriculture, too, is benefiting from 
advances in biochemistry with the development of more effective, environ- 
mentally safer herbicides and pesticides and the creation of genetically en¬ 
gineered plants that are, for example, more resistant to insects. All of these 
endeavors are being accelerated by the advances in genomic sequencing. 

Third, advances in biochemistry are enabling researchers to tackle some of 
the most exciting questions in biology and medicine. How does a fertilized egg 
give rise to cells as different as those in muscle, brain, and liver? How do 
the senses work? What are the molecular bases for mental disorders such as 
Alzheimer disease and schizophrenia? How does the immune system dis- 
tinguish between self and nonself? What are the molecular mechanisms of 
short-term and long-term memory? The answers to such questions, which 
once seemed remote, have been partly uncovered and are likely to be more 
thoroughly revealed in the near future. 

Because all living organisms on Earth are linked by a common origin, 
evolution provides a powerful organizing theme for biochemistry. This book 
is organized to emphasize the unifying principies revealed by evolutionary 
considerations. We begin in the next chapter with a brief tour along a plau- 
sible evolutionary path from the formation of some of the Chemicals that we 
now associate with living organisms through the evolution of the processes 
essential for the development of complex, multicellular organisms. The re- 
mainder of Part I of the book more fully introduces the most important 
classes of biochemicals as well as catalysis and regulation. Part II, Trans- 
ducing and Storing Energy, describes how energy from Chemicals or from 
sunlight is converted into usable forms and how this conversion is regulated. 
As we will see, a small set of molecules such as adenosine triphosphate 
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(ATP) act as energy currencies that allow energy, however captured, to be 
utilized in a variety of biochemical processes. This part of the text exam¬ 
ines the important pathways for the conversion of environmental energy 
into molecules such as ATP and uncovers many unifying principies. Part 
III, Synthesizing the Molecules of Life, illustrates the use of the molecules 
discussed in Part II to synthesize key molecular building blocks, such as the 
bases of DNA and amino acids, and then shows how these precursors are 
assembled into DNA, RNA, and proteins. In Parts II and III, we will high- 
light the relation between the reactions within each pathway and between 
those in different pathways so as to suggest how these individual reactions 
may have combined early in evolutionary history to produce the necessary 
molecules. From the student s perspective, the existence of features com- 
mon to several pathways enables material mastered in one context to be 
readily applied to new contexts. Part IV, Responding to Environmental 
Changes, explores some of the mechanisms that cells and multicellular or- 
ganisms have evolved to detect and respond to changes in the environment. 
The topics range from general mechanisms, common to all organisms, for 
regulating the expression of genes to the sensory systems used by human 
beings and other complex organisms. In many cases, we can now see how 
these elaborate systems evolved from pathways that existed earlier in evo¬ 
lutionary history. Many of the sections in Part IV link biochemistry with 
other fields such as cell biology, immunology, and neuroscience. We are now 
ready to begin our journey into biochemistry with events that took place 
more than 3 billion years ago. 


APPENDIX; DEPICTING MOLECULAR STRUCTURES 


The authors of a biochemistry text face the problem of trying to 
present three-dimensional molecules in the two dimensions 
available on the printed page. The interplay between the three- 
dimensional structures of biomolecules and their biological 
functions will be discussed extensively throughout this book. 
Toward this end, we will frequently use representations that, al- 
though of necessity are rendered in two dimensions, emphasize 
the three-dimensional structures of molecules. 


Stereochemical Renderings 

Mo st of the Chemical formulas in this text are drawn to depict 
the geometric arrangement of atoms, crucial to chemical bond- 
ing and reactivity, as accurately as possible. For example, the car- 
bon atom of methane is sp^ hybridized and tetrahedral, with 


H-C-H angles of 109.5 degrees while the carbon atom in 
formaldehyde is sp^ hybridized with bond angles of 120 degrees. 





Methane Formaldehyde 


To illustrate the correct stereochemistry about carbon atoms, 
wedges will be used to depict the direction of a bond into or out of 
the plane of the page. A solid wedge with the broad end away from 
the carbon denotes a bond coming toward the viewer out of the 
plane. A dashed wedge, with the broad end of the bond at the car¬ 
bon represents a bond going away from the viewer into the plane 
of the page. The remaining two bonds are depicted as straight lines. 



FIGURE 1.16 Molecular representations. Comparison of (A) space-filling, (B) ball-and- 
stick, and (C) skeletal models of ATP. 






Fischer Projections 

Although more representative of the actual structure of a com- 
pound, stereochemical structures are often difficult to draw 
quickly. An alternative method of depicting structures with 
tetrahedral carbon centers relies on the use of Fischer projections. 


W W 



Fischer Stereochemical 

projection rendering 


In a Fischer projection, the bonds to the central carbon are 
represented by horizontal and vertical lines from the sub- 
stituent atoms to the carbon atom, which is assumed to be at 
the center of the cross. By convention, the horizontal bonds are 
assumed to project out of the page toward the viewer, whereas 
the vertical bonds are assumed to project into the page away 
from the viewer. Appendix xx at the back of the book is a struc- 
tural glossary of the key molecules in biochemistry, presented 
both as stereochemically accurate structures and as Fisher 
projections. 

For depicting molecular architecture in more detail, five 
types of models will be used: space filling, bali and stick, skele- 
tal, ribbon, and surface representations (Figure 1.16). The first 
three types show structures at the atomic levei. 

1. Space-fiUing models. The space-filling models are the most 
realistic. The size and position of an atom in a space-filling model 
are determined by its bonding properties and van der Waals ra- 
dius, or contact distance (Section 1.3.1). A van der Waals radius 
describes how closely two atoms can approach each other when 
they are not linked by a covalent bond. The colors of the model 
are set by convention. 

Carbon, black Hydrogen, white Nitrogen, blue 

Oxygen, red Sulfur, yellow Phosphorus, purple 

Space-filling models of several simple molecules are shown in 
Figure 1.17. 
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2. Bali -and- stick models. Bali -and -stick models are not as re¬ 
alistic as space-filling models, because the atoms are depicted 
as spheres of radii smaller than their van der Waals radii. How- 
ever, the bonding arrangement is easier to see because the 
bonds are explicitly represented as sticks. In an illustration, the 
taper of a stick, representing parallax, tells which of a pair of 
bonded atoms is closer to the reader. A bali -and -stick model 
reveals a complex structure more clearly than a space-filling 
model does. 

3. Skeletal models. An even simpler image is achieved with a skele- 
tal model, which shows only the molecular framework. In skeletal 
models, atoms are not shown explicitly. Rather, their positions are 
implied by the junctions and ends of bonds. Skeletal models are 
frequently used to depict larger, more complex structures. 

As biochemistry has advanced, more attention has been fo- 
cused on the structures of biological macromolecules and their 
complexes. These structures comprise thousands or even tens of 
thousands of atoms. Although these structures can be depicted 
at the atomic levei, it is difficult to discern the relevant structural 
features because of the large number of atoms. Thus, more 
schematic representations—ribbon diagrams and surface repre¬ 
sentations—have been developed for the depiction of macro- 
molecular structures in which atoms are not shown explicitly 
(Figure 1.18). 

4. Ribbon diagrams. These diagrams are highly schematic and 
most commonly used to accent a few dramatic aspects of protein 
structure, such as the a helix (a coiled ribbon), the fS strand (a 
broad arrow), and loops (simple lines), so as to provide simple 
and clear views of the folding patterns of proteins. 

5. Surface representations. Often, the interactions between 
macromolecules take place exclusively at their surfaces. Surface 
representations have been developed to better visualize macro- 
molecular surfaces. These representations display the overall 
shapes of macromolecules and can be shaded or colored to indi- 
cate particular features such as surface topography or the distri- 
bution of electric charges. 



FIGURE 1.17 Space-filling models. Structural formulas and space-filling representations 
of selected molecules are shown. 
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FIGURE 1.18 Alternatíve representations of proteín structure. A ribbon 
diagram (A) and a surface representation (B) of a key protein from the immune system 
emphasize different aspects of structure. 
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deoxyribonucleic acid (DNA) (p. 4) 

double helix (p. 5) 

ribonucleic acid (RNA) (p. 6) 

protein (p. 6) 

amino acid (p. 6) 

genetic code (p. 6) 

Eukarya (p. 8) 

Bactéria (p. 8) 

Archaea (p. 8) 


eukaryote (p. 8) 

prokaryote (p. 8) 

covalent bond (p. 8) 

resonance structure (p. 9) 

electrostatic interaction (p. 9) 

hydrogen bond (p. 9) 

van der Waals interaction (p. 9) 

entropy (p. 12) 

enthalpy (p. 13) 


free energy (p. 13) 
hydrophobic effect (p. 14) 
sterochemistry (p. 16) 
Fischer projection (p. 17) 
space-filling model (p. 17) 
ball-and stick-model (p. 17) 
skeletal model (p. 17) 
ribbon diagram (p. 17) 
surface presentation (p. 17) 




Biochemical Evolution 



Natural selection, one of the key forces poweríng evolution, opens an array of 
ímprobable ecologícal níches to specíes that can adapt biochemícally. (Left) Salt 
pools, where the salt concentration can be greater than 1.5 M, would seem to be highly 
inhospitable environments for life. Yet certain halophilic archaea, such as Haloferax 
mediteranei (right), possess biochemical adaptations that enable them to thrive under these 
harsh conditions. [(Left) Kaj R. Svensson/Science Photo Library/Photo Researchers; (right) Wanner/Eye 
of Science/Photo Researchers.] 


OUTLINE 


Earth is approximately 4.5 billion years old. Remarkably, there is convinc- 
ing fóssil evidence that organisms morphologically (and very probably bio- 
chemically) resembling certain modern bactéria were in existence 3.5 bil¬ 
lion years ago. With the use of the results of directed studies and accidental 
discoveries, it is now possible to construct a hypothetical yet plausible evo- 
lutionary path from the prebiotic world to the present. A number of uncer- 
tainties remain, particularly with regar d to the earliest events. Nonetheless, 
a consideration of the steps along this path and the biochemical problems 
that had to be solved provides a useful perspective from which to regard the 
processes found in modern organisms. These evolutionary connections make 

many aspects of biochemistry easier to under st and. _ 

We can think of the path leading to modern living 
species as consisting of stages, although it is important to 
keep in mind that these stages were almost certainly not as 
distinct as presented here. The first stage was the initial gen- 
eration of some of the key molecules of life—nucleic acids, 
proteins, carbohydrates, and lipids—by nonbiological 
processes. The second stage was fundamental—the transi- 
tion from prebiotic chemistry to replicating systems. With 
the passage of time, these systems became increasingly so- 
phisticated, enabling the formation of living cells. In the 
third stage, mechanisms evolved for interconverting energy 
from Chemical sources and sunlight into forms that can be 
utilized to drive biochemical reactions. Intertwined with 
these energy -conversion processes are pathways for syn- 
thesizing the components of nucleic acids, proteins, and 


2.1 Key Organic Molecules Are Used 
by Living Systems 

2.2 Evolution Requires Reproduction, 
Variation, and Selective Pressure 

2.3 Energy Transformations Are 
Necessary to Sustain Living Systems 

2.4 Cells Can Respond to Changes 
in Their Environments 
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FIGURE 2.1 The Urey-Míller experíment. 

An electric discharge (simulating lightning) 
passed through an atmosphere of CH4, NH3, 
H 2 O, and H 2 leads to the generation of key 
organic compounds such as amino acids. 


other key substances from simpler molecules. With the development of 
energy-conversion processes and biosynthetic pathways, a wide variety of 
unicellular organisms evolved. The fourth stage was the evolution of mech- 
anisms that allowed cells to adjust their biochemistry to different, and of- 
ten changing, environments. Organisms with these capabilities could form 
colonies comprising groups of interacting cells, and some eventually evolved 
into complex multicellular organisms. 

This chapter introduces key challenges posed in the evolution of life, 
whose Solutions are elaborated in later chapters. Exploring a possible evo- 
lutionary origin for these fundamental processes makes their use, in con- 
trast with that of potential alternatives, more understandable. 


2.1 KEY ORGANIC MOLECULES ARE USED 
BY LIVING SYSTEMS 


Approximately 1 billion years after Earth s formation, life appeared, as al- 
ready mentioned. Before life could exist, though, another major process 
needed to have taken place—the synthesis of the organic molecules required 
for living Systems from simpler molecules found in the environment. The 
components of nucleic acids and proteins are relatively complex organic 
molecules, and one might expect that only sophisticated synthetic routes 
could produce them. However, this requirement appears not to have been 
the case. How did the building blocks of life come to be? 

2.1.1 Many Components of Biochemical Macromolecules 
Can Be Produced in Simple, Prebiotic Reactions 

Among several competing theories about the conditions of the prebiotic 
world, none is completely satisfactory or problem-free. One theory holds 
that Earth s early atmosphere was highly reduced, rich in methane (CH 4 ), 
ammonia (NH 3 ), water (H 2 O), and hydrogen (H 2 ), and that this atmo¬ 
sphere was subjected to large amounts of solar radiation and lightning. 
For the sake of argument, we will assume that these conditions were indeed 
those of prebiotic Earth. Can complex organic molecules be synthesized 
under these conditions? In the 1950s, Stanley Miller and Harold Urey set 
out to answer this question. An electric discharge, simulating lightning, 
was passed through a mixture of methane, ammonia, water, and hydrogen 
(Figure 2.1). Remarkably, these experiments yielded a highly nonrandom 
mixture of organic compounds, including amino acids and other substances 
fundamental to biochemistry. The procedure produces the amino acids 
glycine and alanine in approximately 2 % yield, depending on the amount of 
carbon supplied as methane. More complex amino acids such as glutamic 
acid and leucine are produced in smaller amounts (Figure 2.2). Hydrogen 
cyanide (HCN), another likely component of the early atmosphere, will con- 
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FIGURE 2.2 Products of prebiotic synthesis. Amino acids produced in the Urey-Miller 
experiment. 
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FIGURE 2.3 Prebiotíc synthesís of a 
nucleíc acíd component. Adenine can 
be generated by the condensation of HCN. 


dense on exposure to heat or light to produce adenine, one of the four nu- 
cleic acid bases (Figure 2.3). Other simple molecules combine to form the 
remaining bases. A wide array of sugars, including ribose, can be formed 
from formaldehyde under prebiotic conditions. 

2.1.2 Uncertainties Obscure the Origins of Some 
Key Biomolecules 

The preceding observations suggest that many of the building blocks found 
in biology are unusually easy to synthesize and that significant amounts 
could have accumulated through the action of nonbiological processes. 
However, it is important to keep in mind that there are many uncertainties. 
For instance, ribose is just one of many sugars formed under prebiotic con¬ 
ditions. In addition, ribose is rather unstable under possible prebiotic con¬ 
ditions. Futhermore, ribose occurs in two mirror-image forms, only one of 
which occurs in modern RN A. To circumvent those problems, the first nu- 
cleic acid-like molecules have been suggested to have been bases attached 
to a different backbone and only later in evolutionary time was ribose in- 
corporated to form nucleic acids as we know them today. Despite these un¬ 
certainties, an assortment of prebiotic molecules did arise in some fashion, 
and from this assortment those with properties favorable for the processes that 
we now associate with life began to interact and to form more complicated com- 
pounds. The processes through which modern organisms synthesize molec¬ 
ular building blocks will be discussed in Chapters 24, 25, and 26. 


2.2 EVOLUTION REQUIRES REPRODUCTION, VARIATION, 
AND SELECTIVE PRESSURE 

Once the necessary building blocks were available, how did a living system 
arise and evolve? Before the appearance of life, simple molecular systems 
must have existed that subsequently evolved into the complex chemical Sys¬ 
tems that are characteristic of organisms. To address how this evolution oc- 
curred, we need to consider the process of evolution. There are several basic 
principies common to evolving systems, whether they are simple collections 
of molecules or competing populations of organisms. First, the most funda¬ 
mental property of evolving systems is their ability to replicate or reproduce. 
Without this ability of reproduction, each ‘‘species’' of molecule that might 
appear is doomed to extinction as soon as all its individual molecules de¬ 
grade. For example, individual molecules of biological polymers such as ri- 
bonucleic acid are degraded by hydrolysis reactions and other processes. 
However, molecules that can replicate will continue to be represented in the 
population even if the lifetime of each individual molecule remains short. 








Relative population 
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perfectly, the replicated molecule will always be the same as the parent mol- 
ecule. Evolution cannot occur. The nature of these variations in living Sys¬ 
tems are considered in Section 2.2.5. 

A third basic principie of evolution is competition. Replicating molecules 
compete with one another for available resources such as Chemical precur- 
sors, and the competition allows the process of evolution by natural selection 
to occur. Variation will produce differing populations of molecules. Some 
variant offspring may, by chance, be better suited for survival and replica- 
tion under the prevailing conditions than are their parent molecules. The 
prevailing conditions exert a selective pressure that gives an advantage to one 
of the variants. Those molecules that are best able to survive and to repli- 
cate themselves will increase in relative concentration. Thus, new molecules 
arise that are better able to replicate under the conditions of their environ- 
ment. The same principies hold true for modern organisms. Organisms re- 
produce, show variation among individual organisms, and compete for re¬ 
sources; those variants with a selective advantage will reproduce more 
successfully. The changes leading to variation still take place at the molec¬ 
ular levei, but the selective advantage is manifest at the organismal levei. 


2.2.1 The Principies of Evolution Can Be Demonstrated in Vitro 

Is there any evidence that evolution can take place at the molecular levei? 
In 1967, Sol Spiegelman showed that replicating molecules could evolve new 
forms in an experiment that allowed him to observe molecular evolution in 
the test tube. He used as his evolving molecules RNA molecules derived 
from a bacterial virus called bacteriophage Qj8. The genome of bacterio- 
phage Q8, a single RNA strand of approximately 3300 bases, depends for 
its replication on the activity of a protein complex termed Q8 replicase. 
Spiegelman mixed the replicase with a starting population of Q8 RNA mol¬ 
ecules. Under conditions in which there are ample amounts of precursors, 
no time constraints, and no other selective pressures, the composition of the 
population does not change from that of the parent molecules on replication. 
When selective pressures are applied, however, the composition of the pop¬ 
ulation of molecules can change dramatically. For example, decreasing the 
time available for replication from 20 minutes to 5 minutes yielded, incre- 
mentally over 75 generations, a population of molecules dominated by a sin¬ 
gle species comprising only 550 bases. This species is replicated 15 times as 
rapidly as the parental Q8 RNA (Figure 2.4). Spiegelman applied other se¬ 
lective pressures by, for example, limiting the concentra- 
tions of precursors or adding compounds that inhibit the 
replication process. In each case, new species appeared that 
replicated more effectively under the conditions imposed. 

The process of evolution demonstrated in these stud- 
ies depended on the existence of machinery for the repli¬ 
cation of RNA fragments in the form of the Q8 replicase. 
As noted in Chapter 1, one of the most elegant charac- 
teristics of nucleic acids is that the mechanism for their 
replication follows naturally from their molecular struc- 
ture. This observation suggests that nucleic acids, per- 
haps RNA, could have become self-replicating. Indeed, 
the results of studies have revealed that single-stranded 
nucleic acids can serve as templates for the synthesis of 
their complementary strands and that this synthesis can 
occur spontaneously—that is, without biologically de- 



FIGURE 2.4 Evolution in a test tube. Rapidly replicating 
species of RNA molecules were generated from QjS RNA by 
exerting selective pressure. The green and blue curves correspond 
to species of intermediate size that accumulated and then 
became extinct in the course of the experiment. 







rived replication machinery. However, investigators have not yet found con- 
ditions in which an RN A molecule is fully capable of independent self- 
replication from simple starting materiais. 
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2.2.2 RNA Molecules Can Act As Catalysts 

The development of capabilities beyond simple replication required the 
generation of specific catalysts. A catalyst is a molecule that accelerates a 
particular chemical reaction without itself being chemically altered in the 
process. The properties of catalysts will be discussed in detail in Chapters 
8 and 9. Some catalysts are highly specific; they promote certain reactions 
without substantially affecting closely related processes. Such catalysts al- 
low the reactions of specific pathways to take place in preference to those 
of potential alternative pathways. Until the 1980s, all biological catalysts, 
termed enzymes, were believed to be proteins. Then, Tom Cech and Sidney 
Altman independently discovered that certain RNA molecules can be ef- 
fective catalysts. These RNA catalysts have come to be known as ribozymes. 
The discovery of ribozymes suggested the possibility that catalytic RNA 
molecules could have played fundamental roles early in the evolution of life. 

The catalytic ability of RNA molecules is related to their ability to adopt 
specific yet complex structures. This principie is illustrated by a ‘Tam- 
merhead’' ribozyme, an RNA structure first identified in plant viruses 
(Figure 2.5). This RNA molecule promotes the cleavage of specific RNA 
molecules at specific sites; this cleavage is necessary for certain aspects of 
the virai life cycle. The ribozyme, which requires mg^"^ ion or other ions for 
the cleavage step to take place, forms a complex with its substrate RNA 
molecule that can adopt a reactive conformation. 

The existence of RNA molecules that possess specific binding and 
catalytic properties makes plausible the idea of an early ''RNA world” 
inhabited by life forms dependent on RNA molecules to play all major roles, 
including those important in heredity, the storage of information, and the 
promotion of specific reactions—that is, biosynthesis and energy metabolism. 
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(A) Ribozyme 




FIGURE 2.5 Catalytic RNA. 

(A) The base-pairing pattern of a 
"hammerhead" ribozyme and its substrate. 

(B) The folded conformation of the complex. 
The ribozyme cleaves the bond at the 
cleavage site. The paths of the nucleic acid 
backbones are highlighted in red and blue. 



2.2.3 Amino Acids and Their Polymers Can Play Biosynthetic 
and Catalytic Roles 

In the early RNA world, the increasing populations of replicating RNA mol¬ 
ecules would have consumed the building blocks of RNA that had been gen- 
erated o ver long periods of time by prebiotic reactions. A shortage of these 
compounds would have favored the evolution of alternative mechanisms 
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FIGURE 2.6 Biosynthesis of RNA bases. 

Amino adds are building blocks for the 
biosynthesis of purines and pyrimidines. 
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FIGURE 2.7 An alternatíve functíonal 
polymer. Proteins are built of amino adds 
linked by peptide bonds. 
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for their synthesis. A large number of pathways are possible. Examining the 
biosynthetic routes utilized by modern organisms can be a source of insight 
into which pathways survived. A striking observation is that simple amino 
acids are used as building blocks for the RNA bases (Figure 2.6). For both 
purines (adenine and guanine) and pyrimidines (uracil and cytosine), an 
amino acid serves as a core onto which the remainder of the base is elabo- 
rated. In addition, nitrogen atoms are donated by the amino group of the 
amino acid aspartic acid and by the amide group of the glutamine side chain. 

Amino acids are chemically more versatile than nucleic acids because 
their side chains carry a wider range of chemical functionality. Thus, amino 
acids or short polymers of amino acids linked by peptide bonds, called 
polypeptides (Figure 2.7), may have functioned as components of ribozymes 
to provide a specific reactivity. Furthermore, longer polypeptides are capa- 
ble of spontaneously folding to form well-defined three-dimensional struc- 
tures, dictated by the sequence of amino acids along their polypeptide 
chains. The ability of polypeptides to fold spontaneously into elaborate 
structures, which permit highly specific chemical interactions with other 
molecules, may have favored the expansion of their roles in the course of 
evolution and is crucial to their dominant position in modern organisms. 
Today, most biological catalysts (enzymes) are not nucleic acids but are in- 
stead large polypeptides called proteins. 

2.2.4 RNA Template-Directed Polypeptide Synthesis Links the 
RNA and Protein Worlds 

Polypeptides would have played only a limited role early in the evolution 
of life because their structures are not suited to self-replication in the way 
that nucleic acid structures are. However, polypeptides could have been in- 
cluded in evolutionary processes indirectly. For example, if the properties 
of a particular polypeptide favored the survival and replication of a class of 
RNA molecules, then these RNA molecules could have evolved ribozyme 
activities that promoted the synthesis of that polypeptide. This method of 
producing polypeptides with specific amino acid sequences has several lim- 
itations. First, it seems likely that only relatively short specific polypeptides 
could have been produced in this manner. Second, it would have been dif- 
ficult to accurately link the particular amino acids in the polypeptide in a 
reproducible manner. Finally, a different ribozyme would have been re- 
quired for each polypeptide. A criticai point in evolution was reached when 
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FIGURE 2.8 Linkíng the RNA and proteín worlds. Polypeptide synthesis is dírected by 
an RNA template. Adaptor RNA molecules, with amino adds attached, sequentially bind to 
the template RNA to facilitate the formation of a peptide bond between two amino adds. 
The growing polypeptide chain remains attached to an adaptor RNA until the completion 
of synthesis. 


an apparatus for polypeptide synthesis developed that allowed the sequence 
of bases in an RNA molecule to directly dictate the sequence of amino adds in 
a polypeptide. A code evolved that established a relation between a specific 
sequence of three bases in RNA and an amino acid. We now call this set of 
three-base combinations, each encoding an amino acid, the genetic code. A 
decoding, or translation, system exists today as the ribosome and associated 
factors that are responsible for essentially all polypeptide synthesis from 
RNA templates in modern organisms. The essence of this mode of polypep¬ 
tide synthesis is illustrated in Figure 2.8. 

An RNA molecule {messenger RNA, or mRNA), containing in its base 
sequence the information that specifies a particular protein, acts as a tem¬ 
plate to direct the synthesis of the polypeptide. Each amino acid is brought 
to the template attached to an adapter molecule specific to that amino acid. 
These adapter s are specialized RNA molecules (called transfer RN As or 
tRNAs). After initiation of the polypeptide chain, a tRNA molecule with 
its associated amino acid binds to the template through specific Watson- 
Crick base-pairing interactions. Two such molecules bind to the ribosome 
and peptide-bond formation is catalyzed by an RNA component (called 
ribosomal RNA or rRNA) of the ribosome. The first RNA departs (with 
neither the polypeptide chain nor an amino acid attached) and another 
tRNA with its associated amino acid bonds to the ribosome. The growing 
polypeptide chain is transferred to this newly bound amino acid with the 
formation of a new peptide bond. This cycle then repeats itself. This scheme 
allows the sequence of the RNA template to encode the sequence of the 
polypeptide and thereby makes possible the production of long polypep- 
tides with specified sequences. The mechanism of protein synthesis will be 
discussed in Chapter 29. Importantly, the ribosome is composed largely of 
RNA and is a highly sophisticated ribozyme, suggesting that it might be a 
surviving relic of the RNA world. 

2.2.5 The Genetic Code Elucidates the Mechanisms of Evolution 

The sequence of bases that encodes a functional protein molecule is called 
a gene. The genetic code—that is, the relation between the base sequence of 
a gene and the amino acid sequence of the polypeptide whose synthesis the 
gene directs—applies to all modern organisms with only very minor excep- 
tions. This universality reveals that the genetic code was fixed early in the 
course of evolution and has been maintained to the present day. 

We can now examine the mechanisms of evolution. Earlier, we consid- 
ered how variation is required for evolution. We can now see that such vari- 
ations in living systems are changes that alter the meaning of the genetic 
message. These variations are called mutations. A mutation can be as sim- 
ple as a change in a single nucleotide (called a point mutation), such that a 
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FIGURE 2.9 Mechanísms of evolution. 

A change in a gene can be (A) as simple 
as a single base change or (B) as dramatic 
as partial or complete gene duplication. 
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sequence of bases that encoded a particular amino acid may now encode 
another (Figure 2.9A). A mutation can also be the insertion or deletion of 
several nucleotides. 

Other types of alteration permit the more rapid evolution of new bio¬ 
chemical activities. For instance, entire sections of the coding material can 
be duplicated, a process called gene duplication (Figure 2.9B). One of the 
duplication products may accumulate mutations and eventually evolve into 
a gene with a different, but related, function. Furthermore, parts of a gene 
may be duplicated and added to parts of another to give rise to a completely 
new gene, which encodes a protein with properties associated with each par- 
ent gene. Higher organisms contain many large families of enzymes and 
other macromolecules that are clearly related to one another in the same 
manner. Thus, gene duplication foliowed by specialization has been a 
crucial process in evolution. It alio ws the generation of macromolecules 
having particular functions without the need to start from scratch. The 
accumulation of genes with subtle and large differences allows for the gen¬ 
eration of more complex biochemical processes and pathways and thus 
more complex organisms. 


3' 



FIGURE 2.10 Cloverleaf pattern of 
tRNA. The pattern of base-pairing 
interactions obsen/ed for all transfer RNA 
molecules reveals that these molecules 
had a common evolutionary origin. 


2.2.6 Transfer RNAs lllustrate Evolution by Gene Duplication 

Transfer RNA molecules are the adaptors that associate an amino acid with 
its correct base sequence. Transfer RNA molecules are structurally similar 
to one another: each adopts a three-dimensional cloverleaf pattern of base- 
paired groups (Figure 2.10). Subtle differences in structure enable the 
protein-synthesis machinery to distinguish transfer RNA molecules with 
different amino acid specificities. 

This family of related RNA molecules likely was generated by gene du¬ 
plication followed by specialization. A nucleic acid sequence encoding one 
member of the family was duplicated, and the two copies evolved inde- 
pendently to generate molecules with specificities for different amino acids. 
This process was repeated, starting from one primordial transfer RNA gene 
until the 20 (or more) distinct members of the transfer RNA family pres- 
ent in modern organisms arose. 

2.2.7 DNA Is a Stable Storage Form for Genetic Information 

It is plausible that RNA was utilized to store genetic information early in 
the history of life. However, in modern organisms (with the exception of 
some viruses), the RNA derivative DNA (deoxyribonucleic acid) performs 
this function (Sections 1.1.1 and 1.1.3). The 2'-hydroxyl group in the 







































Requirements of Evolution 
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FIGURE 2.11 RNA and DNA compared. 

Removal of the 2'-hydroxyl group from 
RNA to form DNA resuits in a backbone 
that is less susceptible to cleavage by 
hydrolysis and thus enables more-stable 
storage of genetic Information. 


ribose unit of the RNA backbone is replaced by a hydrogen atom in DNA 
(Figure 2.11). 

What is the selective advantage of DNA over RNA as the genetic ma¬ 
terial? The genetic material must be extremely stable so that sequence In¬ 
formation can be passed on from generation to generation without degra- 
dation. RNA itself is a remarkably stable molecule; negative charges in the 
sugar-phosphate backbone protect it from attack by hydroxide ions that 
would lead to hydrolytic cleavage. However, the 2'-hydroxyl group makes 
the RNA susceptible to base-catalyzed hydrolysis. The removal of the 2'- 
hydroxyl group from the ribose decreases the rate of hydrolysis by approx- 
imately 100-fold under neutral conditions and perhaps even more under ex¬ 
treme conditions. Thus, the conversion of the genetic material from RNA 
into DNA would have substantially increased its chemical stability. 

The evolutionary transition from RNA to DNA is recapitulated in the 
biosynthesis of DNA in modern organisms. In all cases, the building blocks 
used in the synthesis of DNA are synthesized from the corresponding build¬ 
ing blocks of RNA by the action of enzymes termed ribonucleotide reduc- 
tases. These enzymes convert ribonucleotides (a base and phosphate groups 
linked to a ribose sugar) into deoxyribonucleotides (a base and phosphates 
linked to deoxyribose sugar). 



The properties of the ribonucleotide reductases vary substantially from species 
to species, but evidence suggests that they have a common mechanism of ac¬ 
tion and appear to have evolved from a common primordial enzyme. 

The covalent structures of RNA and DNA differ in one other way. Whereas 
RNA contains uracil, DNA contains a methylated uracil derivative termed 
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FIGURE 2.12 ATP, the energy currency 
of living Systems. The phosphodiester 
bonds (red) release considerable energy 
when cleaved by hydrolysis or other 
processes. 


thymine. This modification also serves to protect the integrity of the genetic se- 
quence, although it does so in a less direct manner. As we will see in Chapter 
27, the methyl group present in thymine facilitates the repair of damaged 
DNA, providing an additional selective advantage. 

Although DNA replaced RN A in the role of storing the genetic infor- 
mation, RN A maintained many of its other functions. RN A still pro vides 
the template that directs polypeptide synthesis, the adaptor molecules, 
the catalytic activity of the ribosomes, and other functions. Thus, the genetic 
message is transcribed from DNA into RNA and then translated into protein. 
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This flow of sequence information from DNA to RNA to protein (to be 
considered in detail in Chapters 5, 28, and 29) applies to all modern or- 
ganisms (with minor exceptions for certain viruses). 


2.3 ENERGY TRANSFORMATIONS ARE NECESSARY TO 
SUSTAIN LIVING SYSTEMS 

Most of the reactions that lead to the biosynthesis of nucleic acids and other 
biomolecules are not thermodynamically favorable under most conditions; 
they require an input of energy to proceed. Thus, they can proceed only 
if they are coupled to processes that release energy. How can energy- 
requiring and energy-releasing reactions be linked? How is energy from the 
environment transformed into a form that living systems can use? Answer- 
ing these questions fundamental to biochemistry is the objective of much 
of this book. 

2.3.1 ATP, a Common Currency for Biochemical Energy, Can Be 
Generated Through the Breakdown of Organic Molecules 

Just as most economies simplify trade by using currency rather than bar- 
tering, biochemical systems have evolved common currencies for the ex- 
change of energy. The most important of these currencies are molecules re- 
lated to adenosine triphosphate (ATP) that contain an array of three linked 
phosphates (Figure 2.12). The bonds linking the phosphates persist in so- 
lution under a variety of conditions, but, when they are broken, an unusu- 
ally large amount of energy is released that can be used to promote other 
processes. The roles of ATP and its use in driving other processes will be 
presented in detail in Chapter 14 and within many other chapters through- 
out this book. 



(ATP) 













FIGURE 2.13 A possible early 
method for generatíng ATP. The 

synthesis of ATP might have been 
driven by the degradation of glycine. 
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ATP must be generated in appropriate quantities to be available for such 
reactions. The energy necessary for the synthesis of ATP can be obtained 
by the breakdown of other Chemicals. Specific enzymes have evolved to cou- 
ple these degradative processes to the phosphorylation of adenosine diphos- 
phate (ADP) to yield ATP. Amino acids such as glycine, which were prob- 
ably present in relatively large quantities in the prebiotic world and early in 
evolution, were likely sources of energy for ATP generation. The degrada¬ 
tion of glycine to acetic acid may be an ATP-generation system that func- 
tioned early in evolution (Figure 2.13). In this reaction, the carbon- 
nitrogen bond in glycine is cleaved by reduction (the addition of electrons), 
and the energy released from the cleavage of this bond drives the coupling 
of ADP and orthophosphate (Pi) to produce ATP. 

Amino acids are still broken down to produce ATP in modern organ- 
isms. However, sugars such as glucose are a more commonly utilized en¬ 
ergy source because they are more readily metabolized and can be stored. 

The most important process for the direct synthesis of ATP in modern or- 
ganisms is glycolysis, a complex process that derives energy from glucose. 

O 

II 4 

2 .C. + 2 ATP^" + 2 H 2 O + 4 e” + 4 

XOO” 

Pyruvate 

Glycolysis presumably evolved as a process for ATP generation after car- 
bohydrates such as glucose were being produced in significant quantities by 
other pathways. Glycolysis will be discussed in detail in Chapter 16. 

2.3.2 Cells Were Formed by the Inclusion of Nucleic Acids 
Within Membranes 

Modern organisms are made up of cells. A cell is composed of nucleic acids, pro- 
teins, and other biochemicals surrounded by a membrane built from lipids. These 
membranes completely enclose their contents, and so cells have a defined 
inside and outside. A typical membrane-forming lipid is phosphatidyl choline. 












Hydrophobic 


Hydrophilic 


CHAPTER 2 • Biochemical Evolution 


FIGURE 2.14 Schematic view of a lípid 
bílayer. These structures define the 
boundaries of cells. 


Osmosis— 

The movement of a solvent across a 
membrane in the direction that tends 
to equalize concentrations of solute on 
the two sides of the membrane. 
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The most important feature of membrane-forming molecules such as 
phosphatidyl choline is that they are amphipathic —that is, they contain both 
hydrophilic (water-loving) and hydrophobic (water-avoiding) components. 
Membrane-forming molecules consist of fatty acids, whose long alkyl 
groups are hydrophobic, connected to shorter hydrophilic ‘Tead groups.'' 
When such lipids are in contact with water, they spontaneously aggregate 
to form specific structures such that the hydrophobic parts of the molecules 
are packed together away from water, whereas the hydrophilic parts are ex- 
posed to the aqueous solution. The structure that is important for mem¬ 
brane formation is the lipid bilayer (Figure 2.14). A bilayer is formed from 
two layers of lipids arranged such that the fatty acid tails of each layer in- 
teract with each other to form a hydrophobic interior while the hydrophilic 
head groups interact with the aqueous solution on each side. Such bilayer 
structures can fold onto themselves to form hollow spheres having interior 
compartments filled with water. The hydrophobic interior of the bilayer 
serves as a barrier between two aqueous phases. If such structures are 
formed in the presence of other molecules such as nucleic acids and pro- 
teins, these molecules can become trapped inside, thus forming cell-like 
structures. The structures of lipids and lipid bilayers will be considered in 
detail in Chapter 12. 

At some stage in evolution, sufficient quantities of appropriate amphi¬ 
pathic molecules must have accumulated from biosynthetic or other 
processes to allow some nucleic acids to become entrapped and cell-like or- 
ganisms to form. Such compartmentalization has many advantages. When 
the components of a cell are enclosed in a membrane, the products of en- 
zymatic reactions do not simply diffuse away into the environment but in- 
stead are contained where they can be used by the cell that produced them. 
The containment is aided by the fact that nearly all biosynthetic interme- 
diates and other biochemicals include one or more charged groups such as 
phosphates or carboxylates. Unlike more nonpolar or neutral molecules, 
charged molecules do not readily pass through lipid membranes. 

2.3.3 Compartmentalization Required the Development 
of lon Pumps 

Despite its many advantages, the enclosure of nucleic acids and proteins 
within membranes introduced several complications. Perhaps the most sig- 
nificant were the effects of osmosis. Membranes are somewhat permeable to 
water and small nonpolar molecules, whereas they are impermeable to 
macromolecules such as nucleic acids. When macromolecules are concen- 
trated inside a compartment surrounded by such a semipermeable mem¬ 
brane, osmotic forces drive water through the membrane into the compart¬ 
ment. Without counterbalancing effects, the flow of water will burst the cell 
(Figure 2.15). 
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FIGURE 2.15 The "osmotíc crísís." 

A cell consisting of macromolecules 
surrounded by a semipermeable membrane 
will take up water from outside the cell 
and burst 


Modern cells have two distinct mechanisms for resisting these osmotic 
forces. One mechanism is to toughen the cell membrane by the introduc- 
tion of an additional structure such as a cell wall. However, such a chemi- 
cally elaborate structure may not have evolved quickly, especially because 
it must completely surround a cell to be effective. The other mechanism is 
the use of energy-dependent ion pumps. These pumps can lower the concen- 
tration of ions inside a cell relative to the outside, favoring the flow of wa¬ 
ter molecules from inside to outside. The resulting unequal distribution of 
ions across an inherently impermeable membrane is called an ion gradient. 
Appropriate ion gradients can balance the osmotic forces and maintain a 
cell at a constant volume. Membrane proteins such as ion pumps will be 
considered in Chapter 13. 

Ion gradients can prevent osmotic crises, but they require energy to be 
produced. Most likely, an ATP-driven proton pump was the first existing 
component of the machinery for generating an ion gradient (Figure 2.16). 
Such pumps, which are found in essentially all modern cells, hydrolyze ATP 
to ADP and inorganic phosphate and utilize the energy released to transport 
protons from the inside to the outside of a cell. The pump thus establishes a 
proton gradient that, in turn, can be coupled to other membrane-transport 
processes such as the removal of sodium ions from the cell. The proton gra¬ 
dient and other ion gradients generated from it act together to counteract os¬ 
motic effects and prevent the cell from swelling and bursting. 

2.3.4 Proton Gradients Can Be Used to Drive the 
Synthesis of ATP 

Enzymes act to accelerate reactions, but they cannot alter the position of 
Chemical equilibria. An enzyme that accelerates a reaction in the forward 
direction must also accelerate the reaction to the same extent in the reverse 
direction. Thus, the existence of an enzyme that utilized the hydrolysis of 
ATP to generate a proton gradient presented a tremendous opportunity for 
the evolution of alternative systems for generating ATP. Such an enzyme 
could synthesize ATP by reversing the process that produces the gradient. 
Enzymes, now called ATP synthases, do in fact use proton gradients to drive 
the bonding of ADP and Pi to form ATP (Figure 2.17). These proteins will 
be considered in detail in Chapter 18. 

Organisms have evolved a number of elaborate mechanisms for the gen- 
eration of proton gradients across membranes. An example is photosynthe- 
sis, a process first used by bactéria and now also used by plants to harness 
the light energy from the sun. The essence of photosynthesis is the light- 
driven transfer of an electron across a membrane. The fundamental 
processes are illustrated in Figure 2.18. 



FIGURE 2.16 Generating an ion 
gradient. ATP hydrolysis can be used to 
drive the pumping of protons (or other 
ions) across a membrane. 
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FIGURE 2.17 Use of proton gradients 
to synthesize ATP. ATP can be 

synthesized by the action of an ATP-driven 
proton pump running in reverse. 
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FIGURE 2.18 Photosynthesís. 

Absorption of light (1) leads to electron 
transfer across a membrane (2). For each 
electron transfer, one excess hydroxide ion 
is generated inside the cell (3). The process 
produces a proton gradient across the 
membrane that can drive ATP synthesis. 




The photosynthetic apparatus, which is embedded in a membrane, con- 
tains pigments that efficiently absorb light from the sun. The absorbed light 
provides the energy to promote an electron in the pigment molecule to an 
excited State. The high-energy electron can then jump to an appropriate ac- 
ceptor molecule located in the part of the membrane facing the inside of the 
cell. The acceptor molecule, now reduced, binds a proton from a water mol¬ 
ecule, generating an hydroxide ion inside the cell. The electronic ‘‘hole’' left 
in the pigment on the outside of the membrane can then be filled by the do- 
nation of an electron from a suitable reductant on the outside of the mem¬ 
brane. Because the generation of an hydroxide ion inside the cell is equiva- 
lent to the generation of a proton outside the cell, a proton gradient develops 
across the membrane. Protons flow down this gradient through ATP syn- 
thases to generate ATP. 

Photosynthesis is but one of a range of processes in different organisms 
that lead to ATP synthesis through the action of proteins evolutionarily re- 
lated to the primordial ATP-driven pumps. In animais, the degradation of 
carbohydrates and other organic compounds is the source of the electron 
flow across membranes that can be used to develop proton gradients. The 
formation of ATP-generating proton gradients by fuel metabolism will be 
considered in Chapter 18 and by light absorption in Chapter 19. 

2.3.5 Molecular Oxygen, a Toxic By-Product of Some 
Photosynthetic Processes, Can Be Utilized for Metabolic Purposes 

As stated earlier, photosynthesis generates electronic ‘‘holes'' in the photo¬ 
synthetic apparatus on the outside of the membrane. These holes are pow- 
erful oxidizing agents; that is, they have very high affinities for electrons 
and can pull electrons from many types of molecules. They can even oxi- 
dize water. Thus, for many photosynthetic organisms, the electron donor 
that completes the photosynthetic cycle is water. The product of water ox- 
idation is oxygen gas—that is, molecular oxygen (O 2 ). 

2 H 2 O- > O 2 + 4 e +4 

The use of water as the electron donor significantly increases the efficiency 
of photosynthetic ATP synthesis because the generation of one molecule of 
oxygen is accompanied not only by the release of four electrons (e~), but 












































also by the release of four protons on one side of the membrane. Thus, an 
additional proton is released for each proton equivalent produced by the ini- 
tial electron-transfer process, so twice as many protons are available to drive 
ATP synthesis. Oxygen generation will be considered in Chapter 19. 

Oxygen was present in only small amounts in the atmosphere before or- 
ganisms evolved that could oxidize water. The ‘‘pollution'' of the air with 
oxygen produced by photosynthetic organisms greatly affected the course 
of evolution. Oxygen is quite reactive and thus extremely toxic to many or¬ 
ganisms. Many biochemical processes have evolved to protect cells from the 
deleterious effects of oxygen and other reactive species that can be gener- 
ated from this molecule. Subsequently, organisms evolved mechanisms for 
taking advantage of the high reactivity of oxygen to promote favorable 
processes. Most important among these mechanisms are those for the oxi- 
dation of organic compounds such as glucose. Through the action of oxy¬ 
gen, a glucose molecule can be completely converted into carbon dioxide 
and water, releasing enough energy to synthesize approximately 30 mole- 
cules of ATP. 

Glucose + 6 O 2 -> 6 CO 2 + 6 H 2 O + energy 

This number represents a 15-fold increase in ATP yield compared with the 
yield from the breakdown of glucose in the absence of oxygen in the process 
of glycolysis. This increased efficiency is apparent in everyday life; our mus- 
cles exhaust their fuel supply and tire quickly if they do not receive enough 
oxygen and are forced to use glycolysis as the sole ATP source. The role of 
oxygen in the extraction of energy from organic molecules will be consid¬ 
ered in Chapter 18. 
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2.4 CELLS CAN RESPOND TO CHANCES 
IN THEIR ENVIRONIVIENTS 

The environments in which cells grow often change rapidly. For example, 
cells may consume all of a particular food source and must utilize others. 
To survive in a changing world, cells evolved mechanisms for adjusting their 
biochemistry in response to signals indicating environmental change. The 
adjustments can take many forms, including changes in the activities of pre- 
existing enzyme molecules, changes in the rates of synthesis of new enzyme 
molecules, and changes in membrane-transport processes. 

Initially, the detection of environmental signals occurred inside cells. 
Chemicals that could pass into cells, either by diffusion through the cell 
membrane or by the action of transport proteins, and could bind directly to 
proteins inside the cell and modulate their activities. An example is the use 
of the sugar arabinose by the bacterium Escherichia coli (Figure 2.19). E. 
coli cells are normally unable to use arabinose efficiently as a source of 



FIGURE 2.19 Responding to environmental condítions. In £ coli cells, the uptake 
of arabinose from the environment triggers the production of enzymes necessary for its 
utilization. 
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energy. However, if arabinose is their only source of carbon, E. coli cells 
synthesize enzymes that catalyze the conversion of this sugar into useful 
forms. This response is mediated by arabinose itself. If present in sufficient 
quantity outside the cell, arabinose can enter the cell through transport pro- 
teins. Once inside the cell, arabinose binds to a protein called AraC. This 
binding alters the structure of AraC so that it can now bind to specific sites 
in the bacterial DNA and increase RNA transcription from genes encoding 
enzymes that metabolize arabinose. The mechanisms of gene regulation will 
be considered in Chapter 31. 

Subsequently, mechanisms appeared for detecting signals at the cell sur- 
face. Cells could thus respond to signaling molecules even if those mole- 
cules did not pass into the cell. Receptor proteins evolved that, embedded 
in the membrane, could bind Chemicals present in the cellular environment. 
Binding produced changes in the protein structure that could be detected 
at the inside surface of the cell membrane. By this means, Chemicals out¬ 
side the cell could influence events inside the cell. Many of these signal- 
transduction pathways make use of substances such as cyclic adenosine 
monophosphate (cAMP) and calcium ion as ‘‘second messengers’' that can 
diffuse throughout the cell, spreading the word of environmental change. 
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FIGURE 2.20 Bactéria with flagella. 

A bacterium (Proteus mirabilis) swims 
through the rotation of filamentous 
structures called flagella. [Fred E. Hossier/ 
Visuais Uniimited.] 


The second messengers may bind to specific sensor proteins inside the cell 
and trigger responses such as the activation of enzymes. Signal-transduc- 
tion mechanisms will be considered in detail in Chapter 15 and in many 
other chapters throughout this book. 

2.4.1 Filamentous Structures and Molecular Motors Enable 
Intracellular and Cellular Movement 

The development of the ability to move was another important stage in the 
evolution of cells capable of adapting to a changing environment. Without 
this ability, nonphotosynthetic cells might have starved after consuming the 
nutrients available in their immediate vicinity. 

Bactéria swim through the use of filamentous structures termed flagella 
that extend from their cell membranes (Figure 2.20). Each bacterial cell has 
several flagella, which, under appropriate conditions, form rotating bundles 
that efficiently propel the cell through the water. These flagella are long 
polymers consisting primarily of thousands of identical protein subunits. 
At the base of each flagellum are assemblies of proteins that act as motors 
to drive its rotation. The rotation of the flagellar motor is driven by the flow 
of protons from outside to inside the cell. Thus, energy stored in the form 
of a proton gradient is transduced into another form, rotatory motion. 

Other mechanisms for motion, also depending on filamentous struc¬ 
tures, evolved in other cells. The most important of these structures are mi- 
crofilaments and microtubules. Microfilaments are polymers of the protein 




actin, and microtubules are polymers of two closely related proteins termed 
a- and ^-tubulin. Unlike a bacterial flagellum, these filamentous structures 
are highly dynamic: they can rapidly increase or decrease in length through 
the addition or subtraction of component protein molecules. Microfila- 
ments and microtubules also serve as tracks on which other proteins move, 
driven by the hydrolysis of ATP. Cells can change shape through the mo- 
tion of molecular motor proteins along such filamentous structures that are 
changing in shape as a result of dynamic polymerization (Figure 2.21). Co- 
ordinated shape changes can be a means of moving a cell across a surface 
and are crucial to cell division. The motor proteins are also responsible for 
the transport of organelles and other structures within eukaryotic cells. Mo¬ 
lecular motors will be considered in Chapter 34. 
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2.4.2 Some Cells Can Interact to Form Colonies 

with Spedalized Functions figure 2.21 Altemative movement. 

Cell mobility can be achieved by changes 

Early organisms lived exclusively as single cells. Such organisms interacted in cell shape. 

with one another only indirectly by competing for resources in their envi- 

ronments. Certain of these organisms, however, developed the ability to 

form colonies comprising many interacting cells. In such groups, the envi- 

ronment of a cell is dominated by the presence of surrounding cells, which 

may be in direct contact with one another. These cells communicate with 

one another by a variety of signaling mechanisms and may respond to sig- 

nals by altering enzyme activity or leveis of gene expression. One result may 

he cell differentiation; differentiated cells are genetically identical but have 

different properties because their genes are expressed differently. 

Several modern organisms are able to switch back and forth from exis- 
tence as independent single cells to existence as multicellular colonies of dif¬ 
ferentiated cells. One of the most well characterized is the slime mold Dic- 
tyostelium. In favorable environments, this organism lives as individual cells; 
under conditions of starvation, however, the cells come together to form a 
cell aggregate. This aggregate, sometimes called a slug, can move as a unit 
to a potentially more favorable environment where it then forms a multi¬ 
cellular structure, termed a fruiting body, that rises substantially above the 
surface on which the cells are growing. Wind may carry cells released from 
the top of the fruiting body to sites where the food supply is more plenti- 
ful. On arriving in a well-stocked location, the cells grow, reproduce, and 
live as individual cells until the food supply is again exhausted (Figure 2.22). 

The transition from unicellular to multicellular growth is triggered by 
cell-cell communication and reveals much about signaling processes be- 
tween and within cells. Under starvation conditions, Dictyostelium cells re- 
lease the signal molecule cyclic AMR This molecule signals surrounding 
cells by binding to a membrane-bound protein receptor on the cell surface. 


FIGURE 2.22 Unicellular to multicellular transition in Dictyostelium. This 
scanning electron migrograph shows the transformation undergone by the slime 
mold Dictyostelium. Hundreds of thousands of single cells aggregate to form a 
migrating slug, seen in the lower left. Once the slug comes to a stop, it gradually 
elongates to form the fruiting body. [Courtesy of M. J. Grimsom and R. L. Blanton, Texas 
Tech University.] 
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FIGURE 2.23 Intracellular sígnalíng. Cyclic AMP, detected by cell-surface receptors, 
initiates the formation of aggregates in Dictyostelium. 



FIGURE 2.24 Cell differentíatíon ín 
Dictyostelium. The colors represent the 
distribution of cell types expressing similar 
sets of genes in the Dictyostelium fruiting 
body. 



FIGURE 2.25 The nematode Caenor- 
habditis elegans. This organism sen/es 
as a usefui model for development. 
[Sinclair Stammers Science Photo Library/Photo 
Researchers.] 


The binding of cAMP molecules to these receptors triggers several re¬ 
sponses, including movement in the direction of higher cAMP concentra- 
tion, as well as the generation and release of additional cAMP molecules 
(Figure 2.23). 

The cells aggregate by following cAMP gradients. Once in contact, they 
exchange additional signals and then differentiate into distinct cell types, 
each of which expresses the set of genes appropriate for its eventual role in 
forming the fruiting body (Figure 2.24). The life cycles of organisms such 
as Dictyostelium foreshadow the evolution of organisms that are multicellu- 
lar throughout their lifetimes. It is also interesting to note the cAMP sig¬ 
nals starvation in many organisms, including human beings. 

2.4.3 The Development of Multicellular Organisms Requires 
the Orchestrated Differentiation of Cells 

The fóssil record indicates that macroscopic, multicellular organisms ap- 
peared approximately 600 million years ago. Most of the organisms famil¬ 
iar to us consist of many cells. For example, an adult human being contains 
approximately 100,000,000,000,000 cells. The cells that make up different 
organs are distinct and, even within one organ, many different cell types are 
present. Nonetheless, the DNA sequence in each cell is identical. The dif- 
ferences between cell types are the result of differences in how these genes 
are expressed. 

Each multicellular organism begins as a single cell. For this cell to de- 
velop into a complex organism, the embryonic cells must follow an intricate 
program of regulated gene expression, cell division, and cell movement. The 
developmental program relies substantially on the responses of cells to the 
environment created by neighboring cells. Cells in specific positions within 
the developing embryo divide to form particular tissues, such as muscle. 
Developmental pathways have been extensively studied in a number of or¬ 
ganisms, including the nematode Caenorhabditis elegans (Figure 2.25), a 
1-mm-long worm containing 959 cells. A detailed map describing the fate 
of each cell in C. elegans from the fertilized egg to the adult is shown in 
Figure 2.26. Interestingly, proper development requires not only cell divi¬ 
sion but also the death of specific cells at particular points in time through 
a process called programmed cell death or apoptosis. 

Investigations of genes and proteins that control development in a wide 
range of organisms have revealed a great many common features. Many of 
the molecules that control human development are evolutionarily related to 
those in relatively simple organisms such as C. elegans. Thus, Solutions to 
the problem of controlling development in multicellular organisms arose 
early in evolution and have been adapted many times in the course of evo¬ 
lution, generating the great diversity of complex organisms. 

























Zygote (one cell) 



2.4.4 The Unity of Biochemistry Allows Human Biology to Be 
Effectively Probed Through Studies of Other Organisms 

All organisms on Earth have a common origin (Figure 2.27). How could 
complex organisms such as human beings have evolved from the simple or¬ 
ganisms that existed at life s start? The path outlined in this chapter reveals 
that most of the fundamental processes of biochemistry were largely fixed 
early in the history of life. The complexity of organisms such as human be¬ 
ings is manifest, at a biochemical levei, in the interactions between over- 
lapping and competing pathways, which lead to the generation of intricately 
connected groups of specialized cells. The evolution of biochemical and 
physiological complexity is made possible by the effects of gene duplication 
followed by specialization. Paradoxically, the reliance on gene duplication 
also makes this complexity easier to comprehend. Consider, for example, 
the protein kinases—enzymes that transfer phosphoryl groups from ATP 
to specific amino acids in proteins. These enzymes play essential roles in 
many signal-transduction pathways and in the control of cell growth and 
differentiation. The human genome encodes approximately 500 proteins of 
this class; even a relatively simple, unicellular organism such as brewer^s 
yeast has more than 100 protein kinases. Yet each of these enzymes is the 
evolutionary descendant of a common ancestral enzyme. Thus, we can learn 
much about the essential behavior of this large collection of proteins through 
studies of a single family member. After the essential behavior is understood, 
we can evaluate the specific adaptations that allow each family member to 
perform its particular biological functions. 

Most central processes in biology have been characterized first in rela¬ 
tively simple organisms, often through a combination of genetic, physio¬ 
logical, and biochemical studies. Many of the processes controlling early 


FIGURE 2.26 Developmental pathways 
of C elegans. The nematode develops 
from a single cell, called a zygote, into 
a complex organism. The fate of each 
individual cell in C elegans is known 
and can be followed by referring to the 
cell-lineage diagram. The labeis indicate 
cells that form specific organs. Cells that 
undergo programmed cell death are shown 
in red. 
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FIGURE 2.27 A possible time líne for 
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indicated. 
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embryonic development were elucidated by the results of studies of the fruit 
• Biochemical Evolution fly. The events controlling DNA replication and the cell cycle were first de- 

ciphered in yeast. Investigators can now test the functions of particular pro- 
teins in mammals by disrupting the genes that encode these proteins in mice 
and examining the effects. The investigations of organisms linked to us by 
common evolutionary pathways are powerful tools for exploring all of bi- 
ology and for developing new understanding of normal human function and 
disease. 


SUMMARY 


O Key Organic Molecules Are Used by Living Systems 

The evolution of life required a series of transitions, beginning with the 
generation of organic molecules that could serve as the building blocks 
for complex biomolecules. How these molecules arose is a matter of con- 
jecture, but experiments have established that they could have formed 
under hypothesized prebiotic conditions. 

Evolution Requires Reproduction, Variation, and Selective Pressure 

The next major transition in the evolution of life was the formation of 
replicating molecules. Replication, coupled with variation and selective 
pressure, marked the beginning of evolution. Variation was introduced 
by a number of means, from simple base substitutions to the duplica- 
tion of entire genes. RN A appears to have been an early replicating mol- 
ecule. Furthermore, some RNA molecules possess catalytic activity. 
However, the range of reactions that RNA is capable of catalyzing is 
limited. With time, the catalytic activity was transferred to proteins— 
linear polymers of the chemically versatile amino acids. RNA directed 
the synthesis of these proteins and still does in modern organisms 
through the development of a genetic code, which relates base sequence 
to amino acid sequence. Eventually, RNA lost its role as the gene to the 
chemically similar but more stable nucleic acid DNA. In modern or¬ 
ganisms, RNA still serves as the link between DNA and protein. 

Energy Transformations Are Necessary to Sustain Living Systems 

Another major transition in evolution was the ability to transform en- 
vironmental energy into forms capable of being used by living systems. 
ATP serves as the cellular energy currency that links energy-yielding 
reactions with energy-requiring reactions. ATP itself is a product of the 
oxidation of fuel molecules, such as amino acids and sugars. With the 
evolution of membranes—hydrophobic barriers that delineate the bor- 
ders of cells—ion gradients were required to prevent osmotic crises. 
These gradients were formed at the expense of ATP hydrolysis. Later, 
ion gradients generated by light or the oxidation of fuel molecules were 
used to synthesize ATP. 

Cells Can Respond to Changes in Their Environments 

The final transition was the evolution of sensing and signaling mecha - 
nisms that enabled a cell to respond to changes in its environment. These 
signaling mechanisms eventually led to cell-cell communication, which 
allowed the development of more-complex organisms. The record of 
much of what has occurred since the formation of primitive organisms 
is written in the genomes of extant organisms. Knowledge of these 
genomes and the mechanisms of evolution will enhance our under¬ 
standing of the history of life on Earth as well as our understanding of 
existing organisms. 
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-1 KEY TERMS 

prebiotic world (p. 20) 
reproduction (p. 21) 
variation (p. 22) 
competition (p. 22) 
selective pressure (p. 22) 
catalyst (p. 23) 
enzyme (p. 23) 
ribozyme (p. 23) 


RN A world (p. 23) 
proteins (p. 24) 
genetic code (p. 25) 
translation (p. 25) 
gene (p. 25) 
mutation (p. 25) 
gene duplication (p. 26) 

ATP (adenosine triphosphate) (p. 28) 


membrane (p. 29) 

ion pump (p. 31) 

ion gradient (p. 31) 

photosynthesis (p. 31) 

signal transduction pathway (p. 34) 

molecular motor protein (p. 35) 

cell differentiation (p. 35) 

unity of biochemistry (p. 37) 
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PROBLEMS 


1. Finding the fragments. Identify the likely source (CH4, NH3, 
H 2 O, or H 2 ) of each atom in alanine generated in the Miller- 
Urey experiment. 

2. Following the populations. In an experiment analogous to the 
Spiegelman experiment, suppose that a population of RNA 


molecules consists of 99 identical molecules, each of which 
replicates once in 15 minutes, and 1 molecule that replicates 
once in 5 minutes. Estimate the composition of the population 
after 1, 10, and 25 “generations” if a generation is defined as 15 
minutes of replication. Assume that all necessary components 
are readily available. 
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3. Selective advantage. Suppose that a replicating RNA mole- 
cule has a mutation (genotypic change) and the phenotypic re- 
sult is that it binds nucleotide monomers more tightly than do 
other RNA molecules in its population. What might the selec¬ 
tive advantage of this mutation be? Under what conditions 
would you expect this selective advantage to be most important? 

4. Opposite of randomness. lon gradients prevent osmotic crises, 
but they require energy to be produced. Why does the forma- 
tion of a gradient require an energy input? 

5. Coupled gradients. How could a proton gradient with a higher 
concentration of protons inside a cell be used to pump ions out 
of a cell? 

6. Proton counting. Consider the reactions that take place across 
a photosynthetic membrane. On one side of the membrane, the 
foliowing reaction takes place: 

4 e~ + 4 A~ + 4 H 2 O-> 4 AH + 4 OH~ 


Need extra help? Purchase chapters of the Student Com- 
panion with complete Solutions online at www.whfreeman.com/ 


biochemS. 


whereas, on the other side of the membrane, the reaction is: 

2 H 2 O-> O 2 + 4 e +4 

How many protons are made available to drive ATP synthesis 
for each reaction cycle? 

7. An alternative pathway. To respond to the availability of sug- 
ars such as arabinose, a cell must have at least two types of pro- 
teins: a transport protein to allow the arabinose to enter the cell 
and a gene-control protein, which binds the arabinose and mod- 
ifies gene expression. To respond to the availability of some very 
hydrophobic molecules, a cell requires only one protein. Which 
one and why? 

8. How many divisions? In the development pathway of C. ele- 
gans, cell division is initially synchronous—that is, all cells di¬ 
vide at the same rate. Later in development, some cells divide 
more frequently than do others. How many times does each cell 
divide in the synchronous period? Refer to Figure 2.26. 






Protein Structure and Function 




f 

^ ^ 
* ' f> 


Crystais of human insulín. Insulin is a protein hormone, crucial for 
maintaining blood sugar at appropriate leveis. (Below) Chains of amino 
acids in a specific sequence (the primary structure) define a protein 
like insulin. These chains fold into well-defined structures (the tertiary 
structure)-in this case a single insulin molecule. Such structures 
assemble with other chains to form arrays such as the complex of six 
insulin molecules shown at the far right (the quarternary structure). 
These arrays can often be induced to form well-defined crystais (photo 
at left), which allows determination of these structures in detail. 

[(Left) Alfred Pasieka/Peter Arnold.] 
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Proteins are the most versatile macromolecules in living systems and serve 
crucial functions in essentially all biological processes. They function as cat- 
alysts, they transport and store other molecules such as oxygen, they pro- 
vide mechanical support and immune protection, they gen- 
erate movement, they transmit nerve impulses, and they 
control growth and differentiation. Indeed, much of this 
text will focus on understanding what proteins do and how 
they perform these functions. 

Several key properties enable proteins to participate in 
such a wide range of functions. 


1. Proteins are linear polymers huilt of monomer units called 
amino acids. The construction of a vast array of macromol¬ 
ecules from a limited number of monomer building blocks 
is a recurring theme in biochemistry. Does protein func¬ 
tion depend on the linear sequence of amino acids? The 
function of a protein is directly dependent on its three- 
dimensional structure (Figure 3.1). Remarkably, proteins 
spontaneously fold up into three- dimensional structures 
that are determined by the sequence of amino acids in the 
protein polymer. Thus, proteins are the emhodiment of the 
transition from the one-dimensional world of sequences to the 
three-dimensional world of molecules capable of diverse 
activities. 

2. Proteins contain a wide range of functional groups. These 
functional groups include alcohols, thiols, thioethers, carboxylic 
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FIGURE 3.1 Structure dktates function. A protein component of the DNA 

replication machinery surrounds a section of DNA double helix. The structure of the protein 
allows large segments of DNA to be copied without the replication machinery dissociating 


from the DNA. 



FIGURE 3.2 A complex protein 
assembly. An electron micrograph of 
insect flight tissue in cross section shows 
a hexagonal array of two kinds of protein 
filaments. [Courtesy of Dr. Michael Reedy] 


acids, carboxamides, and a variety of basic groups. When combined in vari- 
ous sequences, this array of functional groups accounts for the broad spec- 
trum of protein function. For instance, the chemical reactivity associated with 
these groups is essential to the function of enzymes, the proteins that catalyze 
specific Chemical reactions in biological systems (see Chapters 8-10). 

3. Proteins can interact with one another and with other biological macro- 
molecules to form complex assemblies. The proteins within these assemblies 
can act synergistically to generate capabilities not afforded by the individ¬ 
ual component proteins (Figure 3.2). These assemblies include macro- 
molecular machines that carry out the accurate replication of DNA, the 
transmission of signals within cells, and many other essential processes. 

4. Some proteins are quite rigid, whereas others display limited flexibility. 
Rigid units can function as structural elements in the cytoskeleton (the in¬ 
ternai scaffolding within cells) or in connective tissue. Parts of proteins with 
limited flexibility may act as hinges, springs, and levers that are crucial to 
protein function, to the assembly of proteins with one another and with 
other molecules into complex units, and to the transmission of information 
within and between cells (Figure 3.3). 


FIGURE 3.3 Flexibility and 
function. Upon binding iron, the protein 
lactoferrin undergoes conformational 
changes that allow other molecules to 
distinguish between the iron-free and the 
iron-bound forms. 







3.1 PROTEINS ARE BUILT FROM A REPERTOIRE 
OF 20 AMINO ACIDS 


A Repertoire of 20 Amino Acids 


Amino acids are the building blocks of proteins. An a-amino acid consists 
of a central carbon atom, called the a carbon, linked to an amino group, a 
carboxylic acid group, a hydrogen atom, and a distinctive R group. The R 
group is often referred to as the side chain. With four different groups con- 
nected to the tetrahedral a-carbon atom, a-amino acids are chiral; the two 
mirror-image forms are called the L isomer and the D isomer (Figure 3.4). 




FIGURE 3.4 The l and d isomers of amino acids. R refers to the side chain. 

The L and d isomers are mirror images of each other. 

Only L amino acids are constituents of proteins. For almost all amino acids, 
the L isomer has S (rather than R) absolute configuration (Figure 3.5). Al- 
though considerable effort has gone into understanding why amino acids in 
proteins have this absolute configuration, no satisfactory explanation has 
been arrived at. It seems plausible that the selection of L over D was arbi- 
trary but, once made, was fixed early in evolutionary history. 

Amino acids in solution at neutral pH exist predominantly as dipolar 
ions (also called zwitterions). In the dipolar form, the amino group is proton- 
ated (-NHa^) and the carboxyl group is deprotonated (-COO~). The ion- 
ization State of an amino acid varies with pH (Figure 3.6). In acid solution 
(e.g., pH 1), the amino group is protonated (-NH 3 ^) and the carboxyl group 
is not dissociated (-COOH). As the pH is raised, the carboxylic acid is the 
first group to give up a proton, inasmuch as its is near 2. The dipolar 
form persists until the pH approaches 9, when the protonated amino group 




I- 

Notation for distinguishing stereoisomers— 

The four different substituents of an 
asymmetric carbon atom are assigned 
a priority according to atomic number. 
The lowest-priority substituent, often 
hydrogen, is pointed away from the 
viewer. The configuration about the 
carbon is called S, from the Latin sinis- 
ter for "left," if the progression from 
the highest to the lowest priority is 
counterciockwise. The configuration is 
called R, from the Latin rectus for 
"right," if the progression is clockwise. 



FIGURE 3.5 Only l amino acids are 
found in proteins. Almost all l amino 
acids have an S absolute configuration 
(from the Latin sinister meaning "left")- 
The counterciockwise direction of the 
arrow from highest- to lowest-priority 
substituents indicates that the chiral 
center is of the S configuration. 


FIGURE 3.6 lonization State as a 
function of pH. The ionization State of 
amino acids is altered by a change in pH. 
The zwitterionic form predominates near 
physiological pH. 
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Glycine 
(Gly, G) 


Alanine 
(Ala, A) 


FIGURE 3.7 Structures of glycine and 
alanine. (Top) Ball-and-stick models 
Show the arrangement of atoms and 
bonds in space. (Middie) Stereochemically 
realistic formulas show the geometrical 
arrangement of bonds around atoms (see 
Chapter 1 Appendix). (Bottom) Fischer 
projections show all bonds as being 
perpendicular for a simplified 
representation (see Chapter 1 Appendix). 
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FIGURE 3.8 Amino acids with aliphatic side chains. The additional chiral center 
of isoleucine is indicated by an asterisk. 





loses a proton. For a review of acid-base concepts and pH, see the appendix 
to this chapter. 

Twenty kinds of side chains varying in size, shape, charge, hydrogen- 
bonding capacity, hydrophobic character, and chemical reactivity are com- 
monly found in proteins. Indeed, all proteins in all species—bacterial, ar- 
chaeal, and eukaryotic—are constructed from the same set of 20 amino 
acids. This fundamental alphabet of proteins is several billion years old. The 
remarkable range of functions mediated by proteins results from the diver- 
sity and versatility of these 20 building blocks. Understanding how this al¬ 
phabet is used to create the intricate three-dimensional structures that en- 
able proteins to carry out so many biological processes is an exciting area of 
biochemistry and one that we will return to in Section 3.6. 

Let us look at this set of amino acids. The simplest one is glycine, which 
has just a hydrogen atom as its side chain. With two hydrogen atoms 
bonded to the a-carbon atom, glycine is unique in being achiral. Alanine, 
the next simplest amino acid, has a methyl group (-CH 3 ) as its side chain 
(Figure 3.7). 

Larger hydrocarbon side chains are found in valine, leucine, and 
isoleucine (Figure 3.8). Methionine contains a largely aliphatic side chain that 
includes a thioether (-S-) group. The side chain of isoleucine includes an 
additional chiral center; only the isomer shown in Figure 3.8 is found in 
proteins. The larger aliphatic side chains are hydrophobic —that is, they tend 
to cluster together rather than contact water. The three-dimensional struc¬ 
tures of water-soluble proteins are stabilized by this tendency of hy¬ 
drophobic groups to come together, called the hydrophobic effect (see Sec¬ 
tion 1.3.3). The different sizes and shapes of these hydrocarbon side chains 
enable them to pack together to form compact structures with few holes. 
Proline also has an aliphatic side chain, but it differs from other members 
of the set of 20 in that its side chain is bonded to both the nitrogen and the 
a-carbon atoms (Figure 3.9). Proline markedly influences protein architec- 
ture because its ring structure makes it more conformationally restricted 
than the other amino acids. 



(Pro, P) 


Three amino acids with relatively simple aromatic side chains are part of 
the fundamental repertoire (Figure 3.10). Phenylalanine, as its name indi- 
cates, contains a phenyl ring attached in place of one of the hydrogens of 
alanine. The aromatic ring of tyrosine contains a hydroxyl group. This hy- 
droxyl group is reactive, in contrast with the rather inert side chains of the 
other amino acids discussed thus far. Tryptophan has an indole ring joined 
to a methylene (—CH 2 —) group; the indole group comprises two fused rings 
and an NH group. Phenylalanine is purely hydrophobic, whereas tyrosine 
and tryptophan are less so because of their hydroxyl and NH groups. The 
aromatic rings of tryptophan and tyrosine contain delocalized tt electrons 
that strongly absorb ultraviolet light (Figure 3.11). 

A compound s extinction coefficient indicates its ability to absorb light. 
Beer s law gives the absorbance (A) of light at a given wavelength: 
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FIGURE 3.9 Cyclk structure of proline. 

The side chain is joined to both the a 
carbon and the amino group. 
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Phenylalanine 
(Phe, F) 


Tyrosine 

(Tyr,Y) 


Tryptophan 
(Trp, W) 


FIGURE 3.10 Amíno acíds wíth 
aromatic síde chaíns. Phenylalanine, 
tyrosine, and tryptophan have hydrophobic 
character. Tyrosine and tryptophan aiso 
have hydrophilic properties because of 
their -OH and -NH- groups, respectively. 
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A = ecl Beer’s law 

where e is the extinction coefficient [in units that are the 
reciprocais of molarity and distance in centimeters (M~^ 
cm~^)], c is the concentration of the absorbing species (in 
units of molarity, M), and I is the length through which 
the light passes (in units of centimeters). For tryptophan, 
absorption is maximum at 280 nm and the extinction 
coefficient is 3400 cm~^ whereas, for tyrosine, ab¬ 
sorption is maximum at 276 nm and the extinction coef¬ 
ficient is a less-intense 1400 cm~h Phenylalanine 
absorbs light less strongly and at shorter wavelengths. 
The absorption of light at 280 nm can be used to estimate 


FIGURE 3.11 Absorption spectra of the aromatic amino acids 
tryptophan (red) and tyrosine (blue). Oniy these amino acids 
absorb strongly near 280 nm. [Courtesy of Greg Gatto]. 
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the concentration of a protein in solution if the number of tryptophan and 
tyrosine residues in the protein is known. 

Two amino acids, serine and threonine, contain aliphatic hydroxyl groups 
(Figure 3.12). Serine can be thought of as a hydroxylated version of alanine, 
whereas threonine resembles valine with a hydroxyl group in place of one 
of the valine methyl groups. The hydroxyl groups on serine and threonine 
make them much more hydrophilic (water loving) and reactive than alanine 
and valine. Threonine, like isoleucine, contains an additional asymmetric 
center; again only one isomer is present in proteins. 

Cysteine is structurally similar to serine but contains a sulfhydryl, or thiol 
(-SH), group in place of the hydroxyl (-OH) group (Figure 3.13). The 
sulfhydryl group is much more reactive. Pairs of sulfhydryl groups may 
come together to form disulfide bonds, which are particularly important in 
stabilizing some proteins, as will be discussed shortly. 



FIGURE 3.13 Structure of cysteine. 


We turn now to amino acids with very polar side chains that render them 
highly hydrophilic. Lysine and arginine have relatively long side chains that 
terminate with groups that are positively charged at neutral pH. Lysine is 
capped by a primary amino group and arginine by a guanidinium group. 
Histidine contains an imidazole group, an aromatic ring that also can be pos¬ 
itively charged (Figure 3.14). 
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FIGURE 3.14 The bask amíno acíds 
lysine, argíníne, and hístidíne. 



FIGURE 3.15 Hístidíne ionization. 

Histidine can bind or release protons near 
physiological pH. 
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With a pKa value near 6, the imidazole group can be uncharged or posi- 
tively charged near neutral pH, depending on its local environment (Figure 
3.15). Indeed, histidine is often found in the active sites of enzymes, where 
the imidazole ring can bind and release protons in the course of enzymatic 
reactions. 

The set of amino acids also contains two with acidic side chains: aspar- 
tic acid and glutamic acid (Figure 3.16). These amino acids are often called 
aspartate and glutamate to emphasize that their side chains are usually neg- 
atively charged at physiological pH. Nonetheless, in some proteins these 
side chains do accept protons, and this ability is often functionally impor- 
tant. In addition, the set includes uncharged derivatives of aspartate and 
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FIGURE 3.16 Amíno acíds wíth síde-chaín carboxylates and carboxamides. 


glutamate —asparagine and glutamine —each of which contains a terminal 
carboxamide in place of a carboxylic acid (Figure 3.16). 

Seven of the 20 amino acids have readily ionizable side chains. These 
7 amino acids are able to donate or accept protons to facilitate reactions 
as well as to form ionic bonds. Table 3.1 gives equilibria and typical pfCa 
values for ionization of the side chains of tyrosine, cysteine, arginine, ly- 
sine, histidine, and aspartic and glutamic acids in proteins. Two other 
groups in proteins—the terminal n-amino group and the terminal a- 
carboxyl group—can be ionized, and typical pKa values are also included 
in Table 3.1. 

Amino acids are often designated by either a three-letter abbreviation or 
a one-letter symbol (Table 3.2). The abbreviations for amino acids are the 
first three letters of their names, except for asparagine (Asn), glutamine 
(Gin), isoleucine (Ile), and tryptophan (Trp). The symbols for many amino 
acids are the first letters of their names (e.g., G for glycine and L for leucine); 
the other symbols have been agreed on by convention. These abbreviations 
and symbols are an integral part of the vocabulary of biochemists. 

Y How did this particular set of amino acids become the building blocks 
of proteins? First, as a set, they are diverse; their structural and Chem¬ 
ical properties span a wide range, endowing proteins with the versatility to 
assume many functional roles. Second, as noted in Section 2.1.1, many of 
these amino acids were probably available from prebiotic reactions. Finally, 
excessive intrinsic reactivity may have eliminated other possible amino 
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TABLE 3.1 Typical pK^ values of ionizable groups in proteins 


Group 


Terminal a-carboxyl group 


Aspartic acid 
Glutamic acid 
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Terminal a-amino group 
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Lysine 
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— S 
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+ JN-H 
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Base Typical pKa 
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li - 
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li - 
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-n.,h 
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n^,h 

H 

H 

\ 

N 

,N-H 

H 


H 


3.1 

4.1 

6.0 

8.0 

8.3 

10.9 

10.8 

12.5 


*pKa values depend on temperature, ionic strength, and the microenvironment of the 
ionizable group. 


TABLE 3.2 Abbreviations for amino acids 


Amino acid 

Three-letter 

abbreviation 

One-letter 

abbreviation 

Amino acid 

Three-letter 

abbreviation 

One-letter 

abbreviation 

Alanine 

Ala 

A 

Methionine 

Met 

M 

Arginine 

Arg 

R 

Phenylalanine 

Phe 

F 

Asparagine 

Asn 

N 

Proline 

Pro 

P 

Aspartic Acid 

Asp 

D 

Serine 

Ser 

S 

Cysteine 

Cys 

C 

Threonine 

Thr 

T 

Glutamine 

Gin 

Q 

Tryptophan 

Trp 

W 

Glutamic Acid 

Glu 

E 

Tyrosine 

Tyr 

Y 

Glycine 

Gly 

G 

Valine 

Vai 

V 

Histidine 

His 

H 

Asparagine or 

Asx 

B 

Isoleucine 

Ile 

I 

aspartic acid 



Leucine 

Leu 

L 

Glutamine or 

Glx 

Z 

Lysine 

Lys 

K 

glutamic acid 




































acids. For example, amino acids such as homoserine and homocysteine tend 
to form five-membered cyclic forms that limit their use in proteins; the al¬ 
ternativo amino acids that are found in proteins—serine and cysteine—do 
not readily cyclize, because the rings in their cyclic forms are too small 
(Figure 3.17). 
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3.2 PRIIVIARY STRUCTURE; AMINO ACIDS ARE LINKED 
BY PEPTIDE BONDS TO FORM POLYPEPTIDE CHAINS 

Proteins are linear polymers formed by linking the n-carboxyl group of one 
amino acid to the o:-amino group of another amino acid with a peptide bond 
(also called an amide bond). The formation of a dipeptide from two amino acids 
is accompanied by the loss of a water molecule (Figure 3.18). The equilibrium 
of this reaction lies on the side of hydrolysis rather than synthesis. Hence, 
the biosynthesis of peptide bonds requires an input of free energy. Nonethe- 
less, peptide bonds are quite stable kinetically; the lifetime of a peptide bond 
in aqueous solution in the absence of a catalyst approaches 1000 years. 


H H 

/P + /P 

+H3N^ +H3N^ 

^ I! - ^ II " 

O o 



H 20 


Peptide bond 


A series of amino acids joined by peptide bonds form ã polypeptide chain, 
and each amino acid unit in a polypeptide is called a residue. A polypeptide 
chain has polarity because its ends are different, with an n-amino group at 
one end and an n-carboxyl group at the other. By convention, the amino end 
is taken to be the beginning of a polypeptide chain, and so the sequence of 
amino acids in a polypeptide chain is written starting with the amino- 
terminal residue. Thus, in the pentapeptide Tyr-Gly-Gly-Phe-Leu (YGGFL), 
phenylalanine is the amino-terminal (N-terminal) residue and leucine is the 
carboxyl-terminal (C-terminal) residue (Figure 3.19). Leu-Phe-Gly-Gly-Tyr 
(LFGGY) is a different pentapeptide, with different chemical properties. 

A polypeptide chain consists of a regularly repeating part, called the main 
chain or backbone, and a variable part, comprising the distinctive side chains 
(Figure 3.20). The polypeptide backbone is rich in hydrogen-bonding po- 
tential. Each residue contains a carbonyl group, which is a good hydrogen- 
bond acceptor and, with the exception of proline, an NH group, which is a 
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FIGURE 3.17 Undesírable reactívíty in 
amino acids. Some amino acids are 
unsuitable for proteins because of 
undesirable cyclization. Homoserine can 
cyclize to form a stable, five-membered 
ring, potentially resulting in peptide-bond 
cleavage. Cyclization of serine would form 
a strained, four-membered ring and thus 
is unfavored. X can be an amino group 
from a neighboring amino acid or another 
potential leaving group. 


FIGURE 3.18 Peptide-bond formation. 

The linking of two amino acids is 
accompanied by the loss of a molecule 
of water. 
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FIGURE 3.19 Amíno acíd sequences 
have dírectíon. This illustration of the 
pentapeptide Try-Gly-Gly-Phe-Leu (YGGFL) 
shows the sequence from the amino 
terminus to the carboxyl terminus. This 
pentapeptide, Leu-enkephalin, is an opioid 
peptide that modulates the perception of 
pain. The reverse pentapeptide, Leu-Phe- 
Gly-Gly-Tyr (LFGGY), is a different molecule 
and shows no such effects. 



Tyr Gly Gly Phe Leu 


Amino 
terminal residue 



Carboxyl 
terminal residue 


FIGURE 3.20 Components of a 
polypeptíde chaín. A polypeptide chain 
consists of a constant backbone (shown in 
black) and variable side chains (shown in 
green). 



I- 

Dalton- 

A unit of mass very nearly equal to that 
of a hydrogen atom. Named after John 
Dalton (1766-1844), who developed 
the atomic theory of matter. 


I- 

Kilodalton (kd)- 

A unit of mass equal to 1000 daltons. 


good hydrogen-bond donor. These groups interact with each other and with 
functional groups from side chains to stabilize particular structures, as will 
be discussed in detail. 

Most natural polypeptide chains contain between 50 and 2000 amino 
acid residues and are commonly referred to as proteins. Peptides made of 
small numbers of amino acids are called oligopeptides or simply peptides. The 
mean molecular weight of an amino acid residue is about 110, and so the 
molecular weights of most proteins are between 5500 and 220,000. We can 
also refer to the mass of a protein, which is expressed in units of daltons; 
one dalton is equal to one atomic mass unit. A protein with a molecular 
weight of 50,000 has a mass of 50,000 daltons, or 50 kd (kilodaltons). 

In some proteins, the linear polypeptide chain is cross-linked. The most 
common cross -links are disulfide bonds, formed by the oxidation of a pair of 
cysteine residues (Figure 3.21). The resulting unit of linked cysteines is 


FIGURE 3.21 Cross-línks. The formation 
of a disulfide bond from two cysteine 
residues is an oxidation reaction. 
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called cystine. Extracellular proteins often have several disulfide bonds, 
whereas intracellular proteins usually lack them. Rarely, nondisulfide cross- 
links derived from other side chains are present in some proteins. For ex- 
ample, collagen fibers in connective tissue are strengthened in this way, as 
are fibrin blood clots. 

3.2.1 Proteins Have Unique Amino Acid Sequences 
That Are Specified by Genes 

In 1953, Frederick Sanger determined the amino acid sequence of insulin, 
a protein hormone (Figure 3.22). This work is a landmark in biochemistry be- 
cause it showed for the first time that a protein has a precisely defined amino 
acid sequence. Moreover, it demonstrated that insulin consists only of L 
amino acids linked by peptide bonds between n-amino and n-carboxyl 
groups. This accomplishment stimulated other scientists to carry out se¬ 
quence studies of a wide variety of proteins. Indeed, the complete amino 
acid sequences of more than 100,000 proteins are now known. The striking 
fact is that each protein has a unique, precisely defined amino acid sequence. 
The amino acid sequence of a protein is often referred to as its primary 
structure. 

A series of incisive studies in the late 1950s and early 1960s revealed that 
the amino acid sequences of proteins are genetically determined. The se¬ 
quence of nucleotides in DNA, the molecule of heredity, specifies a com- 
plementary sequence of nucleotides in RN A, which in turn specifies the 
amino acid sequence of a protein. In particular, each of the 20 amino acids 
of the repertoire is encoded by one or more specific sequences of three nu¬ 
cleotides (Section 5.5). 

Knowing amino acid sequences is important for several reasons. First, 
knowledge of the sequence of a protein is usually essential to elucidating its 
mechanism of action (e.g., the catalytic mechanism of an enzyme). More¬ 
over, proteins with novel properties can be generated by varying the se¬ 
quence of known proteins. Second, amino acid sequences determine the 
three-dimensional structures of proteins. Amino acid sequence is the link 
between the genetic message in DNA and the three-dimensional structure 
that performs a protein^s biological function. Analyses of relations between 
amino acid sequences and three-dimensional structures of proteins are un- 
covering the rules that govern the folding of polypeptide chains. Third, se¬ 
quence determination is a component of molecular pathology, a rapidly 
growing area of medicine. Alterations in amino acid sequence can produce 
abnormal function and disease. Severe and sometimes fatal diseases, such 
as sickle-cell anemia and cystic fibrosis, can result from a change in a sin¬ 
gle amino acid within a protein. Fourth, the sequence of a protein reveals 
much about its evolutionary history (see Chapter 7). Proteins resemble one 
another in amino acid sequence only if they have a common ancestor. Con- 
sequently, molecular events in evolution can be traced from amino acid se¬ 
quences; molecular paleontology is a flourishing area of research. 
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Gly-lle-Val-Glu-Gin-Cys-Cys-Ala-Ser-Val-Cys-Ser-Leu-Tyr-Gin-Leu-Glu-Asn-Tyr-Cys-Asn 
5 I 10 15 I 21 




s 


/ 


B Chain 


FIGURE 3.22 Amino acid sequence 
of bovíne insulin. 


Phe-Val-Asn-Gin-His-Leu-Cys-Gly-Ser-His-Leu-Val-Glu-Ala-Leu-Tyr-Leu-Val-Cys-Gly-Glu-Arg-Gly-Phe-Phe-Tyr-Thr-Pro-Lys-Ala 
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FIGURE 3.23 Peptide bonds are planar. In a pair of línked 
amino acids, six atoms (Q, C, O, N, H, and CJ lie in a plane. 
Side chains are shown as green balis. 


3.2.2 Polypeptide Chains Are Flexible 
Yet Conformationally Restricted 

Examination of the geometry of the protein backbone re- 
veals several important features. First, the peptide bond is 
essentially planar (Figure 3.23). Thus, for a pair of amino 
acids linked by a peptide bond, six atoms lie in the same 
plane: the a-carbon atom and CO group from the first 
amino acid and the NH group and a-carbon atom from 
the second amino acid. The nature of the chemical bond- 
ing within a peptide explains this geometric preference. 
The peptide bond has considerable double-bond charac- 
ter, which prevents rotation about this bond. 

\/ ^ \/ ^ 

II l\ I l\ 

O I o- I 

Peptide bond resonance structures 


The inability of the bond to rotate constrains the con- 
formation of the peptide backbone and accounts for the 
bond s planarity. This double-bond character is also ex- 
pressed in the length of the bond between the CO and 
NH groups. The C-N distance in a peptide bond is typ- 
ically 1.32 A, which is between the values expected for 
a C-N single bond (1.49 Â) and a C=N double bond 
(1.27 A), as shown in Figure 3.24. Finally, the peptide 
bond is uncharged, allowing polymers of amino acids 
linked by peptide bonds to form tightly packed globular 
structures. 

Two configurations are possible for a planar peptide 
bond. In the trans configuration, the two a-carbon atoms 
are on opposite sides of the peptide bond. In the cis con¬ 
figuration, these groups are on the same side of the peptide bond. Almost 
all peptide bonds in proteins are trans. This preference for trans over cis can 
be explained by the fact that steric clashes between groups attached to the 
a-carbon atoms hinder formation of the cis form but do not occur in the trans 
configuration (Figure 3.25). By far the most common cis peptide bonds are 
X-Pro linkages. Such bonds show less preference for the trans configuration 
because the nitrogen of proline is bonded to two tetrahedral carbon atoms, 
limiting the steric differences between the trans and cis forms (Figure 3.26). 

In contrast with the peptide bond, the bonds between the amino group 
and the a-carbon atom and between the a-carbon atom and the carbonyl 
group are pure single bonds. The two adjacent rigid peptide units may rotate 
about these bonds, taking on various orientations. This freedom of rotation 



FIGURE 3.24 Typícal bond lengths wíthín a peptide unít. 

The peptide unit is shown in the trans configuration. 


FIGURE 3.25 Trans and cis peptide 
bonds. The trans form is strongly favored 
because of steric clashes that occur in the 
cis form. 
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FIGURE 3.26 Trans and cís X-Pro 
bonds. The energies of these forms are 
relatively balanced because steric clashes 
occur in both forms. 


about two bonds of each amino acid allows proteins to fold in many different 
ways. The rotations about these bonds can be specified by dihedral angles 
(Figure 3.27). The angle of rotation about the bond between the nitrogen 
and the n-carbon atoms is called phi ((/)). The angle of rotation about the 
bond between the n-carbon and the carbonyl carbon atoms is called psi (i//). 
A clockwise rotation about either bond as viewed from the front of the back 
group corresponds to a positive value. The c|) and \\f angles determine the 
path of the polypeptide chain. 

Are all combinations of (|) and i[i possible? G. N. Ramachandran recog- 
nized that many combinations are forbidden because of steric collisions be¬ 
tween atoms. The allowed values can be visualized on a two-dimensional 
plot called a Ramachandran diagram (Figure 3.28). Three-quarters of the 
possible ((|), v|i) combinations are excluded simply by local steric clashes. 
Steric exclusion, the fact that two atoms cannot be in the same place at the 
same time, can be a powerful organizing principie. 

The ability of biological polymers such as proteins to fold into well- 
defined structures is remarkable thermodynamically. Consider the equilib- 
rium between an unfolded polymer that exists as a random coil—that is, as 
a mixture of many possible conformations— and the folded form that adopts 
a unique conformation. The favor able entropy associated with the large 
number of conformations in the unfolded form opposes folding and must 
be overcome by interactions favoring the folded form. Thus, highly flexi- 
ble polymers with a large number of possible conformations do not fold into 
unique structures. The rigidity of the peptide unit and the restricted set of al¬ 
lowed 4> if/ angles limits the number of structures accessible to the unfolded 
form sufficiently to allow protein folding to occur. 


I- 

Dihedral angle— 

A measure of the rotation about a 
bond, usually taken to lie between 
-180° and +180°. Dihedral angles are 
sometimes called torsion angles. 



(C) 



\\f = +85° 


FIGURE 3.27 Rotation about bonds in a polypeptide. The structure of each amino 
acid in a polypeptide can be adjusted by rotation about two single bonds. (A) Phi (cj)) is the 
angle of rotation about the bond between the nitrogen and the a-carbon atoms, whereas 
psi (i|i) is the angle of rotation about the bond between the a-carbon and the carbonyl 
carbon atoms. (B) A view down the bond between the nitrogen and the a-carbon atoms, 
showing how c|) is measured. (C) A view down the bond between the a-carbon and the 
carbonyl carbon atoms, showing how i}; is measured. 
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FIGURE 3.28 A Ramachandran 
díagram showing the values of ct> and 

i|/. Not all c|) and i|i values are possible 
without coilisions between atoms. The 
most favorable regions are shown in dark 
green; borderline regions are shown in 
light green. The structure on the right is 
disfavored because of steric clashes. 




% STRUCTURAL INSIGHTS, appearing 
throughout the book, are molecular 
modeling-based tutoriais that enable you 
to review structure and learn what the 
latest research tells us about the workings 
of the molecule. To access, go to the 
Web site: www.whfreeman.com/biochem5, 
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and the titie. 


I- 

Screw sense— 

Describes the direction in which a heli- 
cal structure rotates with respect to its 
axis. If, viewed down the axis of a he- 
lix, the Chain turns in a clockwise direc¬ 
tion, it has a right-handed screw sense. 
If the turning is counterciockwise, the 
screw sense is left-handed. 


3.3 SECONDARY STRUCTURE; POLYPEPTIDE CHAINS 
CAN FOLD INTO REGULAR STRUCTURES SUCH AS 
THE ALPHA HELIX, THE BETA SHEET, AND TURNS 
AND LOOPS 

Can a polypeptide chain fold into a regularly repeating structure? In 1951, 
Linus Pauling and Robert Corey proposed two periodic structures called 
the a helix (alpha helix) and the j8 pleated sheet (beta pleated sheet). Subse- 
quently, other structures such as the j8 turn and omega (il) loop were iden- 
tified. Although not periodic, these common turn or loop structures are well 
defined and contribute with a helices and |3 sheets to form the final protein 
structure. 


STRUCTURAL INSIGHTS, Elements of Protein Structure provides interactive 
representations of some of the important elements of protein architecture described in 
this chapter, including a summary of secondary structure motifs. 

3.3.1 The Alpha Helix Is a Coiled Structure Stabilized 
by Intrachain Hydrogen Bonds 

In evaluating potential structures, Pauling and Corey considered which con- 
formations of peptides were sterically allowed and which most fully ex- 
ploited the hydrogen-bonding capacity of the backbone NH and CO 
groups. The first of their proposed structures, the a helix, is a rodlike struc¬ 
ture (Figure 3.29). A tightly coiled backbone forms the inner part of the rod 
and the side chains extend outward in a helical array. The a helix is stabi¬ 
lized by hydrogen bonds between the NH and CO groups of the main chain. 
In particular, the CO group of each amino acid forms a hydrogen bond with 
the NH group of the amino acid that is situated four residues ahead in the 
sequence (Figure 3.30). Thus, except for amino acids near the ends of an a 
helix, all the main-chain CO and NH groups are hydrogen bonded. Each 
residue is related to the next one by a rise of 1.5 À along the helix axis and 
a rotation of 100 degrees, which gives 3.6 amino acid residues per turn of 
helix. Thus, amino acids spaced three and four apart in the sequence are 
spatially quite dose to one another in an a helix. In contrast, amino acids 
two apart in the sequence are situated on opposite sides of the helix and so 
are unlikely to make contact. The pitch of the a helix, which is equal to the 
product of the translation (1.5 A) and the number of residues per turn (3.6), 
is 5.4 À. The screw sense of a helix can be right-handed (clockwise) or left- 
handed (counterciockwise). The Ramachandran diagram reveals that both 


































FIGURE 3.29 Structure of the OL helíx. (A) A ribbon depiction with the a-carbon atoms 
and side chains (green) shown. (B) A side view of a ball-and-stick version depicts the 
hydrogen bonds (dashed lines) between NH and CO groups. (C) An end view shows the 
coiled backbone as the inside of the helix and the side chains (green) projecting outward. 
(D) A space-filling view of part C shows the tightly packed interior core of the helix. 
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FIGURE 3.30 Hydrogen-bondíng 
scheme for an ol helíx. In the a helix, 
the CO group of residue n forms a 
hydrogen bond with the NH group of 
residue n + 4. 


the right-handed and the left-handed helices are among allowed conforma- 
tions (Figure 3.31). However, right-handed helices are energetically more 
favorable because there is less steric clash between the side chains and the 
backbone. Essentially all a helices found in proteins are right-handed. In 
schematic diagrams of proteins, a helices are depicted as twisted ribbons or 
rods (Figure 3.32). 

Pauling and Corey predicted the structure of the a helix 6 years before 
it was actually seen in the x-ray reconstruction of the structure of myoglo- 
bin. The elucidation of the structure of the a helix is a landmark in biochem- 
istry because it demonstrated that the conformation of a polypeptide chain can 
be predicted if the properties of its components are rigorously and precisely 
known. 

The a-helical content of proteins ranges widely, from nearly none to al- 
most 100%. For example, about 7 5% of the residues in ferritin, a protein that 
helps store iron, are in a helices (Figure 3.33). Single a helices are usually 
less than 45 A long. However, two or more a helices can entwine to form a 
very stable structure, which can have a length of 1000 À (100 nm, or 0.1 pcm) 
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FIGURE 3.31 Ramachandran díagram 
for helices. Both right- and left-handed 
helices lie in regions of allowed 
conformations in the Ramachandran 
diagram. However, essentially all a helices 
in proteins are right-handed. 
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FIGURE 3.32 Schematic víews of ol 
helíces. (A) A ball-and-stick model. 

(B) A ribbon depiction. (C) A cylindrical 
depiction. 




protein. Ferritin, an iron-storage protein, 
is built from a bundie of a helices. 


or more (Figure 3.34). Such a-helical coiled coils are found in myosin and 
tropomyosin in muscle, in fibrin in blood clots, and in keratin in hair. The 
helical cables in these proteins serve a mechanical role in forming stiff bun- 
dles of fibers, as in porcupine quills. The cytoskeleton (internai scaffolding) 
of cells is rich in so-called intermediate filaments, which also are two- 
stranded a-helical coiled coils. Many proteins that span biological mem- 
branes also contain a helices. 



20 A 


FIGURE 3.34 An a-helícal coiled coil. The two helices wind around one another 
to form a superhelix. Such structures are found in many proteins including keratin in hair, 
quills, claws, and horns. 



FIGURE 3.35 Ramachandran diagram 
for p strands. The red area shows the 
sterically allowed conformations of 
extended, p-strand-like structures. 


3.3.2 Beta Sheets Are Stabilized by Hydrogen Bonding 
Between Polypeptide Strands 

Pauling and Corey discovered another periodic structural motif, which they 
named the (3 pleated sheet (|3 because it was the second structure that they 
elucidated, the a helix having been the first). The |3 pleated sheet (or, more 
simply, the |3 sheet) differs markedly from the rodlike a helix. A polypep¬ 
tide chain, called a j8 strand, in a |3 sheet is almost fully extended rather than 
being tightly coiled as in the a helix. A range of extended structures are ster¬ 
ically allowed (Figure 3.35). 

The distance between adjacent amino acids along a |3 strand is approxi- 
mately 3.5 A, in contrast with a distance of 1.5 A along an a helix. The side 
chains of adjacent amino acids point in opposite directions (Figure 3.36). A 
(3 sheet is formed by linking two or more (3 strands by hydrogen bonds. Ad¬ 
jacent chains in a |3 sheet can run in opposite directions (antiparallel |3 sheet) 
or in the same direction (parallel |3 sheet). In the antiparallel arrangement, the 
NH group and the CO group of each amino acid are respectively hydrogen 
bonded to the CO group and the NH group of a partner on the adjacent chain 
(Figure 3.37). In the parallel arrangement, the hydrogen-bonding scheme is 
slightly more complicated. For each amino acid, the NH group is hydrogen 
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FIGURE 3.36 Structure of a p strand. 

The side chains (green) are alternately 
above and below the plane of the strand. 



FIGURE 3.37 An antiparallel p sheet. 

Adjacent p strands run in opposite 
directions. Hydrogen bonds between NH 
and CO groups connect each amino acid 
to a single amino acid on an adjacent 
strand, stabilizing the structure. 


bonded to the CO group of one amino acid on the adjacent strand, whereas 
the CO group is hydrogen bonded to the NH group on the amino acid two 
residues farther along the chain (Figure 3.38). Many strands, typically 4 or 5 
but as many as 10 or more, can come together in |3 sheets. Such |3 sheets can 
be purely antiparallel, purely parallel, or mixed (Figure 3.39). 

In schematic diagrams, |3 strands are usually depicted by broad arrows 
pointing in the direction of the carboxyl-terminal end to indicate the type 
of P sheet formed—parallel or antiparallel. More structurally diverse than 
a helices, P sheets can be relatively flat but most adopt a somewhat twisted 
shape (Figure 3.40). The P sheet is an important structural element in many 
proteins. For example, fatty acid-binding proteins, important for lipid me- 
tabolism, are built almost entirely from P sheets (Figure 3.41). 



FIGURE 3.38 A parallel p sheet. 

Adjacent p strands run in the same 
direction. Hydrogen bonds connect each 
amino acid on one strand with two 
different amino acids on the adjacent 
strand. 
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FIGURE 3.39 Structure of a mixed 
P sheet. 




(B) 


(C) 






FIGURE 3.40 A twísted p sheet. (A) A ball-and-stick model. (B) A schematic 
model. (C) The schematic view rotated by 90 degrees to illustrate the twist 
more clearly. 



FIGURE 3.41 A protein rich 
in p sheets. The structure of a fatty 
acid-binding protein. 


3.3.3 Polypeptide Chains Can Change Direction 
by Making Reverse Turns and Loops 

Most proteins have compact, globular shapes, requiring reversals in the di¬ 
rection of their polypeptide chains. Many of these reversals are accom- 
plished by a common structural element called the reverse turn (also known 
as the j8 turn or hairpin bend), illustrated in Figure 3.42. In many reverse 
turns, the CO group of residue i of a polypeptide is hydrogen bonded to the 
NH group of residue i + 3. This interaction stabilizes abrupt changes in di¬ 
rection of the polypeptide chain. In other cases, more elaborate structures 
are responsible for chain reversals. These structures are called loops or some- 
times Í2 loops (omega loops) to suggest their overall shape. Unlike a helices 
and P strands, loops do not have regular, periodic structures. Nonetheless, 
loop structures are often rigid and well defined (Figure 3.43). Turns and 
loops invariably lie on the surfaces of proteins and thus often participate in 








FIGURE 3.42 Structure of a reverse 
turn. The CO group of residue / of the 
polypeptide chain is hydrogen bonded to 
the NH group of residue / + 3 to stabilize 
the turn. 


interactions between proteins and other molecules. The distribution of a 
helices, |3 strands, and turns along a protein chain is often referred to as its 
secondary structure. 


3.4 TERTIARY STRUCTURE; WATER-SOLUBLE 
PROTEINS FOLD INTO COIVIPACT STRUCTURES 
WITH NONPOLAR CORES 



FIGURE 3.43 Loops on a protein 
surface. A part of an antibody molecule 
has surface loops (shown in red) that 
mediate interactions with other molecules. 


Let us now examine how amino acids are grouped together in a complete 
protein. X-ray crystallographic and nuclear magnetic resonance studies 
(Section 4.5) have revealed the detailed three- dimensional structures of 
thousands of proteins. We begin here with a preview of myoglobin, the first 
protein to be seen in atomic detail. 

Myoglobin, the oxygen carrier in muscle, is a single polypeptide chain of 
153 amino acids (see also Chapters 7 and 10). The capacity of myoglobin to 
bind oxygen depends on the presence of heme, a nonpolypeptide prosthetic 
(helper) group consisting of protoporphyrin IX and a central iron atom. Myo- 
globin is an extremely compact molecule. Its overall dimensions are 45 X 35 X 
2 5 A, an order of magnitude less than if it were fully stretched out (Figure 3.44). 



(B) 



% FIGURE 3.44 Three-dímensíonal structure of myoglobin. (A) This ball-and-stick 
model shows all nonhydrogen atoms and reveais many interactions between the amino 
acids. (B) A schematic view shows that the protein consists largely of a helices. The heme 
group is shown in black and the iron atom is shown as a purple sphere. 
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FIGURE 3.45 Dístributíon of 
amíno acíds ín myoglobín. (A) A 

space-filling model of myoglobin with 
hydrophobic amino adds shown in yellow, 
charged amino adds shown in blue, and 
others shown in white. The surface of the 
molecule has many charged amino acids, 
as well as some hydrophobic amino acids. 
(B) A cross-sectional view shows that 
mostiy hydrophobic amino acids are found 
on the inside of the structure, whereas 
the charged amino acids are found on the 
protein surface. 



About 70% of the main chain is folded into eight a helices, and much of the 
rest of the chain forms turns and loops between helices. 

The folding of the main chain of myoglobin, like that of most other pro- 
teins, is complex and devoid of symmetry. The overall course of the polypep- 
tide chain of a protein is referred to as its tertiary structure. A unifying prin¬ 
cipie emerges from the distribution of side chains. The striking fact is that 
the interior consists almost entirely of nonpolar residues such as leucine, va- 
line, methionine, and phenylalanine (Figure 3.45). Charged residues such 
as aspartate, glutamate, lysine, and arginine are absent from the inside of 
myoglobin. The only polar residues inside are two histidine residues, which 
play criticai roles in binding iron and oxygen. The outside of myoglobin, on 
the other hand, consists of both polar and nonpolar residues. The space- 
filling model shows that there is very little empty space inside. 

This contrasting distribution of polar and nonpolar residues reveals a 
key facet of protein architecture. In an aqueous environment, protein fold¬ 
ing is driven by the strong tendency of hydrophobic residues to be excluded 
from water (see Section 1.3.3). Recall that a system is more thermodynam- 
ically stable when hydrophobic groups are clustered rather than extended 
into the aqueous surroundings. The polypeptide chain therefore folds so that 
its hydrophobic side chains are buried and its polar, charged chains are on the 
surface. Many a helices and (3 strands are amphipathic; that is, the a helix 
or (3 strand has a hydrophobic face, which points into the protein interior, 
and a more polar face, which points into solution. The fate of the main chain 
accompanying the hydrophobic side chains is important, too. An unpaired 
peptide NH or CO group markedly prefers water to a nonpolar milieu. The 
secret of burying a segment of main chain in a hydrophobic environment is 
pairing all the NH and CO groups by hydrogen bonding. This pairing is 
neatly accomplished in an a helix or |3 sheet. Van der Waals interactions be¬ 
tween tightly packed hydrocarbon side chains also contribute to the stabil- 
ity of proteins. We can now understand why the set of 20 amino acids con- 
tains several that differ subtly in size and shape. They provide a palette from 
which to choose to fill the interior of a protein neatly and thereby maximize 
van der Waals interactions, which require intimate contact. 

Some proteins that span biological membranes are ‘‘the exceptions that 
prove the rule'' regarding the distribution of hydrophobic and hydrophilic 
amino acids throughout three- dimensional structures. For example, con- 
sider porins, proteins found in the outer membranes of many bactéria 
(Figure 3.46). The permeability barriers of membranes are built largely of 
alkane chains that are quite hydrophobic (Section 12.2.1). Thus, porins are 





Quaternary Structure 
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hydrophilic channel 
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Largely hydrophobic 
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FIGURE 3.46 "Insíde out" 
amino acid distríbution ín porín. 

The outside of porin (which contacts 
hydrophobic groups in membranes) is 
covered largely with hydrophobic residues, 
whereas the center includes a water-filled 
channel lined with charged and polar 
amino acids. 


covered on the outside largely with hydrophobic residues that interact with 
the neighboring alkane chains. In contrast, the center of the protein con- 
tains many charged and polar amino acids that surround a water-filled 
channel running through the middle of the protein. Thus, because porins 
function in hydrophobic environments, they are ‘hnside out'' relative to pro- 
teins that function in aqueous solution. 

Some polypeptide chains fold into two or more compact regions that may 
be connected by a flexible segment of polypeptide chain, rather like pearls 
on a string. These compact globular units, called domains, range in size from 
about 30 to 400 amino acid residues. For example, the extracellular part of 
CD4, the cell-surface protein on certain cells of the immune system to which 
the human immunodeficiency virus (HIV) attaches itself, comprises four 
similar domains of approximately 100 amino acids each (Figure 3.47). Of- 
ten, proteins are found to have domains in common even if their overall ter- 
tiary structures are different. 



FIGURE 3.47 Protein domains. 

The cell-surface protein CD4 consists of 
four similar domains. 


3.5 QUATERNARY STRUCTURE; POLYPEPTIDE CHAINS 
CAN ASSEMBLE INTO MULTISUBUNIT STRUCTURES 

Four leveis of structure are frequently cited in discussions of protein archi- 
tecture. So far, we have considered three of them. Primary structure is the 
amino acid sequence. Secondary structure refers to the spatial arrangement 
of amino acid residues that are nearby in the sequence. Some of these 
arrangements are of a regular kind, giving rise to a periodic structure. The 
a helix and |3 strand are elements of secondary structure. Tertiary structure 





protein of bacteriophage A is a dimer of identical subunits. hemoglobín. The structure of the two identical a subunits (red) is 

similar to but not identical with that of the two identical p subunits 
(yellow). The molecule contains four heme groups (black with the 
iron atom shown in purple). 



FIGURE 3.50 Complex quaternary 
structure. The coat of rhinovirus 
comprises 60 copies of each of four 
subunits. (A) A schematic view depicting 
the three types of subunits (shown in red, 
blue, and green) visible from outside the 
virus. (B) An electron micrograph showing 
rhinovirus particles. [Courtesy of Norm Olson, 
Dept. of Biological Sciences, Purdue University.] 


refers to the spatial arrangement of amino acid residues that are far apart in 
the sequence and to the pattern of disulfide bonds. We now turn to proteins 
containing more than one polypeptide chain. Such proteins exhibit a fourth 
levei of structural organization. Each polypeptide chain in such a protein is 
called a subunit. Quaternary structure refers to the spatial arrangement of 
subunits and the nature of their interactions. The simplest sort of quater¬ 
nary structure is a dimer, consisting of two identical subunits. This organ¬ 
ization is present in the DNA-binding protein Cro found in a bacterial virus 
called A (Figure 3.48). More complicated quaternary structures also are 
common. More than one type of subunit can be present, often in variable 
numbers. For example, human hemoglobin, the oxygen-carrying protein in 
blood, consists of two subunits of one type (designated a) and two subunits 
of another type (designated |3), as illustrated in Figure 3.49. Thus, the he¬ 
moglobin molecule exists as an a 2^2 tetramer. Subtle changes in the arrange¬ 
ment of subunits within the hemoglobin molecule allow it to carry oxygen 
from the lungs to tissues with great efficiency (Section 10.4.2). 

Viruses make the most of a limited amount of genetic information by 
forming coats that use the same kind of subunit repetitively in a symmet- 
ric array. The coat of rhinovirus, the virus that causes the common cold, in- 
cludes 60 copies each of four subunits (Figure 3.50). The subunits come to- 
gether to form a nearly spherical shell that encloses the virai genome. 


3.6 THE AIVIINO ACID SEQUENCE OF A PROTEIN 
DETERMINES ITS THREE-DIMENSIONAL STRUCTURE 

How is the elaborate three-dimensional structure of proteins attained, and 
how is the three-dimensional structure related to the one-dimensional 
amino acid sequence information? The classic work of Christian Anfinsen 
in the 1950s on the enzyme ribonuclease revealed the relation between the 
amino acid sequence of a protein and its conformation. Ribonuclease is a 
single polypeptide chain consisting of 124 amino acid residues cross-linked 
by four disulfide bonds (Figure 3.51). Anfinsen^s plan was to destroy the 











FIGURE 3.51 Amino acid sequence of FIGURE 3.52 Role of p-mercaptoethanol in redudng dísuifíde bonds. Note that, 

bovíne ríbonuclease. The four disulfide bonds as the disulfides are reduced, the p-mercaptoethanol is oxidized and forms dimers. 

are shown in color. [After C. H. W. Hirs, S. Moore, 
and W. H. Stein, 7. Biol. Chem. 235 (1960):633.] 


three- dimensional structure of the enzyme and to then determine what con- 
ditions were required to restore the structure. 

Agents such as urea or guanidinium chloride effectively disrupt the non- 
covalent bonds, although the mechanism of action of these agents is not 
fully understood. The disulfide bonds can be cleaved reversibly by reduc- 
ing them with a reagent such dis ^-mercaptoethanol (Figure 3.52). In the pres- 
ence of a large excess of |3-mercaptoethanol, a protein is produced in which 
the disulfides (cystines) are fully converted into sulfhydryls (cysteines). 

Most polypeptide chains devoid of cross-links assume a random-coil con- 
formation in 8 M urea or 6 M guanidinium chloride, as evidenced by phys- 
ical properties such as viscosity and optical activity. When ribonuclease was 
treated with |3-mercaptoethanol in 8 M urea, the product was a fully re¬ 
duced, randomly coiled polypeptide chain devoid of enzymatic activity. In 
other words, ribonuclease was denatured by this treatment (Figure 3.53). 

Anfinsen then made the criticai observation that the denatured ribonu¬ 
clease, freed of urea and |3-mercaptoethanol by dialysis, slowly regained en¬ 
zymatic activity. He immediately perceived the significance of this chance 
finding: the sulfhydryl groups of the denatured enzyme became oxidized 
by air, and the enzyme spontaneously refolded into a catalytically active 
form. Detailed studies then showed that nearly all the original enzymatic 
activity was regained if the sulfhydryl groups were oxidized under suitable 
conditions. All the measured physical and chemical properties of the re¬ 
folded enzyme were virtually identical with those of the native enzyme. 
These experiments showed that the information needed to specify the cat¬ 
alytically active structure of ribonuclease is contained in its amino acid se¬ 
quence. Subsequent studies have established the generality of this central 
principie of biochemistry: sequence specifies conformation. The dependence 
of conformation on sequence is especially significant because of the inti- 
mate connection between conformation and function. 
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FIGURE 3.53 Reduction and 
denaturatíon of ríbonuclease. 
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FIGURE 3.54 Reestablishing correct 
dísuifíde paíríng. Native ribonuclease 
can be reformed from scrambled 
ribonuclease in the presence of a trace of 
p-mercaptoethanol. 


A quite different result was obtained when reduced ribonuclease was re- 
oxidized while it was still in 8 M urea and the preparation was then dialyzed 
to remove the urea. Ribonuclease reoxidized in this way had only 1% of the 
enzymatic activity of the native protein. Why were the outcomes so different 
when reduced ribonuclease was reoxidized in the presence and absence of urea? 
The reason is that the wrong disulfides formed pairs in urea. There are 105 
different ways of pairing eight cysteine molecules to form four disulfides; only 
one of these combinations is enzymatically active. The 104 wrong pairings have 
been picturesquely termed ‘‘scrambled'' ribonuclease. Anfinsen found that 
scrambled ribonuclease spontaneously converted into fully active, native ri¬ 
bonuclease when trace amounts of P-mercaptoethanol were added to an aque- 
ous solution of the protein (Figure 3.54). The added |3-mercaptoethanol cat- 
alyzed the rearrangement of disulfide pairings until the native structure was 
regained in about 10 hours. This process was driven by the decrease infree en- 
ergy as the scrambled conformations were converted into the stable, native con- 
formation of the enzyme. The native disulfide pairings of ribonuclease thus con- 
tribute to the stabilization of the thermodynamically preferred structure. 

Similar refolding experiments have been performed on many other pro- 
teins. In many cases, the native structure can be generated under suitable 
conditions. For other proteins, however, refolding does not proceed effi- 
ciently. In these cases, the unfolding protein molecules usually become tan- 
gled up with one another to form aggregates. Inside cells, proteins called 
chaperones block such illicit interactions (Sections 11.36 and 29.x.x). 

3.6.1 Amino Acids Have Different Propensities for Forming 
Alpha Helices, Beta Sheets, and Beta Turns 

How does the amino acid sequence of a protein specify its three- dimensional 
structure? How does an unfolded polypeptide chain acquire the form of the 
native protein? These fundamental questions in biochemistry can be ap- 
proached by first asking a simpler one: What determines whether a particu¬ 
lar sequence in a protein forms an a helix, a (3 strand, or a turn? Examining 
the frequency of occurrence of particular amino acid residues in these sec- 
ondary structures (Table 3.3) can be a source of insight into this determina- 
tion. Residues such as alanine, glutamate, and leucine tend to be present in a 
helices, whereas valine and isoleucine tend to be present in |3 strands. Glycine, 
asparagine, and proline have a propensity for being in turns. 

The results of studies of proteins and synthetic peptides have revealed 
some reasons for these preferences. The a helix can be regarded as the de- 
fault conformation. Branching at the |3-carbon atom, as in valine, threonine, 
and isoleucine, tends to destabilize a helices because of steric clashes. These 
residues are readily accommodated in |3 strands, in which their side chains 
project out of the plane containing the main chain. Serine, aspartate, and 
asparagine tend to disrupt a helices because their side chains contain 
hydrogen-bond donors or acceptors in dose proximity to the main chain, 
where they compete for main-chain NH and CO groups. Proline tends to 
disrupt both a helices and |3 strands because it lacks an NH group and be¬ 
cause its ring structure restricts its ^ value to near —60 degrees. Glycine 
readily fits into all structures and for that reason does not favor helix for- 
mation in particular. 

Can one predict the secondary structure of proteins by using this knowl- 
edge of the conformational preferences of amino acid residues? Predictions 
of secondary structure adopted by a stretch of six or fewer residues have 
proved to be about 60 to 70% accurate. What stands in the way of more ac- 
curate prediction? Note that the conformational preferences of amino acid 
residues are not tipped all the way to one structure (see Table 3.3). For 
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TABLE 3.3 Relative frequencies of amino acid residues in secondary structures 


Amino acid 

a helix 

P sheet 

Turn 

Ala 

1.29 

0.90 

0.78 

Cys 

1.11 

0.74 

0.80 

Leu 

1.30 

1.02 

0.59 

Met 

1.47 

0.97 

0.39 

Glu 

1.44 

0.75 

1.00 

Gin 

1.27 

0.80 

0.97 

His 

1.22 

1.08 

0.69 

Lys 

1.23 

0.77 

0.96 

Vai 

0.91 

1.49 

0.47 

Ile 

0.97 

1.45 

0.51 

Phe 

1.07 

1.32 

0.58 

Tyr 

0.72 

1.25 

1.05 

Trp 

0.99 

1.14 

0.75 

Thr 

0.82 

1.21 

1.03 

Gly 

0.56 

0.92 

1.64 

Ser 

0.82 

0.95 

1.33 

Asp 

1.04 

0.72 

1.41 

Asn 

0.90 

0.76 

1.28 

Pro 

0.52 

0.64 

1.91 

Arg 

0.96 

0.99 

0.88 


Note: The amino acids are grouped according to their preference for a helices (top 
group), p sheets (second group), or turns (third group). Arginine shows no significant 
preference for any of the structures. 

After T. E. Creighton, Proteins: Structures and Molecular Properties, 2d ed. 

(W. H. Freeman and Company, 1992), p. 256. 
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example, glutamate, one of the strongest helix formers, 
prefers a helix to |3 strand by only a factor of two. The 
preference ratios of most other residues are smaller. In- 
deed, some penta- and hexapeptide sequences have been 
found to adopt one structure in one protein and an en- 
tirely different structure in another (Figure 3.55). 
Hence, some amino acid sequences do not uniquely de¬ 
termine secondary structure. Tertiary interactions—in- 
teractions between residues that are far apart in the se- 
quence—may be decisive in specifying the secondary 
structure of some segments. The context is often crucial 
in determining the conformational outcome. The confor- 
mation of a protein evolved to work in a particular en- 
vironment or context. 


Pathological conditions can result if a protein assumes an inappro- 
priate conformation for the context. Striking examples areprion dis- 
eases, such as Creutzfeldt-Jacob disease, kuru, and mad cow disease. These 
conditions result when a brain protein called a prion converts from its nor¬ 
mal conformation (designated PrP^) to an altered one (PrP^^). This con- 
version is self-propagating, leading to large aggregates of PrP^^. The role 
of these aggregates in the generation of the pathological conditions is not 
yet understood. 




FIGURE 3.55 Alternatíve 
conformatíons of a peptíde sequence. 

Many sequences can adopt alternatíve 
conformatíons ín different proteins. Here 
the sequence VDLLKN shown in red 
assumes an a helix in one protein 
context (left) and a p strand in another 
(right). 












FIGURE 3.56 Transítíon from folded to 
unfolded State. Most proteins show a 
Sharp transition from the folded to 
unfolded form on treatment with 
increasing concentrations of denaturants. 



3.6.2 Protein Folding Is a Highly Cooperative Process 

As stated earlier, proteins can be denatured by heat or by Chemical denat¬ 
urants such as urea or guanidium chloride. For many proteins, a compari- 
son of the degree of unfolding as the concentration of denaturant increases 
has revealed a relatively sharp transition from the folded, or native, form to 
the unfolded, or denatured, form, suggesting that only these two confor- 
mational States are present to any significant extent (Figure 3.56). A simi¬ 
lar sharp transition is observed if one starts with unfolded proteins and re¬ 
moves the denaturants, allowing the proteins to fold. 

Protein folding and unfolding is thus largely an ''all or none” process that 
results from a cooperative transition. For example, suppose that a protein is 
placed in conditions under which some part of the protein structure is ther- 
modynamically unstable. As this part of the folded structure is disrupted, 
the interactions between it and the remainder of the protein will be lost. The 
loss of these interactions, in turn, will destabilize the remainder of the struc¬ 
ture. Thus, conditions that lead to the disruption of any part of a protein 
structure are likely to unravel the protein completely. The structural prop- 
erties of proteins provide a clear rationale for the cooper¬ 
ative transition. 

The consequences of cooperative folding can be illus- 
trated by considering the contents of a protein solution 
under conditions corresponding to the middle of the tran¬ 
sition between the folded and unfolded forms. Under 
these conditions, the protein is ‘Talf folded.'' Yet the so¬ 
lution will contain no half-folded molecules but, instead, 
will be a 50/50 mixture of fully folded and fully unfolded 
molecules (Figure 3.57). Structures that are partly intact 
and partly disrupted are not thermodynamically stable 
and exist only transiently. Cooperative folding ensures 
that partly folded structures that might interfere with 
processes within cells do not accumulate. 


FIGURE 3.57 Components of a 
partíally denatured protein solution. 

In a h a If-unfolded protein solution, half 
the molecules are fully folded and half 
are fully unfolded. 


3.6.3 Proteins Fold by Progressive Stabilization 
of Intermediates Rather Than by Random Search 

The cooperative folding of proteins is a thermodynamic property; its oc- 
currence reveals nothing about the kinetics and mechanism of protein fold¬ 
ing. How does a protein make the transition from a diverse ensemble of un¬ 
folded structures into a unique conformation in the native form? One 
possibility a priori would be that all possible conformations are tried out to 
find the energetically most favorable one. How long would such a random 
search take? Consider a small protein with 100 residues. Cyrus Levinthal 
calculated that, if each residue can assume three different conformations, 
the total number of structures would be 3^^^, which is equal to 5X10^^. If 
it takes 10~^^ s to convert one structure into another, the total search time 
would be 5 X 10"^^ X 10~^^ s, which is equal to 5 X 10^"^ s, or 1.6 X 10^^ 
years. Clearly, it would take much too long for even a small protein to fold 
properly by randomly trying out all possible conformations. The enormous 
difference between calculated and actual folding times is called LevinthaVs 
Paradox. 

The way out of this dilemma is to recognize the power of cumulative se- 
lection. Richard Dawkins, in The Blind Watchmaker, asked how long it 
would take a monkey poking randomly at a typewriter to reproduce Ham- 
let's remark to Polonius, ‘‘Methinks it is like a weasel" (Figure 3.58). An as- 
tronomically large number of keystrokes, of the order of lO"^^, would be re- 
quired. However, suppose that we preserved each correct character and 












allowed the monkey to retype only the wrong ones. In this case, only a few 
thousand keystrokes, on average, would be needed. The crucial difference 
between these cases is that the first employs a completely random search, 
whereas, in the second, partly correct intermediates are retained. 

The essence of protein folding is the retention of partly correct intermedi¬ 
ates. However, the protein-folding problem is much more difficult than the 
one presented to our simian Shakespeare. First, the criterion of correctness 
is not a residue-by-residue scrutiny of conformation by an omniscient 
observer but rather the total free energy of the transient species. Second, 
proteins are only marginally stable. The free-energy difference between 
the folded and the unfolded States of a typical 100-residue protein is 10 kcal 
mol~^ (42 kj mol~^), and thus each residue contributes on average only 
0.1 kcal moP^ (0.42 kJ moP^) of energy to maintain the folded state. 
This amount is less than that of thermal energy, which is 0.6 kcal moP^ 
(2.5 kJ moP ^) at room temperature. This meager stabilization energy means 
that correct intermediates, especially those formed early in folding, can be 
lost. The analogy is that the monkey would be somewhat free to undo its 
correct keystrokes. Nonetheless, the interactions that lead to cooperative 
folding can stabilize intermediates as structure builds up. Thus, local re- 
gions, which have significant structural preference, though not necessarily 
stable on their own, will tend to adopt their favored structures and, as they 
form, can interact with one other, leading to increasing stabilization. 

3.6.4 Prediction of Three-Dimensional Structure 
from Sequence Remains a Great Challenge 

The amino acid sequence completely determines the three-dimensional 
structure of a protein. However, the prediction of three-dimensional struc¬ 
ture from sequence has proved to be extremely difficult. As we have seen, 
the local sequence appears to determine only between 60% and 70% of the 
secondary structure; long-range interactions are required to fix the full sec- 
ondary structure and the tertiary structure. 

Investigators are exploring two fundamentally different approaches to 
predicting three-dimensional structure from amino acid sequence. The first 
is ab initio prediction, which attempts to predict the folding of an amino acid 
sequence without any direct reference to other known protein structures. 
Computer-based calculations are employed that attempt to minimize the 
free energy of a structure with a given amino acid sequence or to simulate 
the folding process. The utility of these methods is limited by the vast num- 
ber of possible conformations, the marginal stability of proteins, and the 
subtle energetics of weak interactions in aqueous solution. The second ap- 
proach takes advantage of our growing knowledge of the three-dimensional 
structures of many proteins. In these knowledge-based methods, an amino 
acid sequence of unknown structure is examined for compatibility with any 
known protein structures. If a significant match is detected, the known 
structure can be used as an initial model. Knowledge-based methods have 
been a source of many insights into the three-dimensional conformation of 
proteins of known sequence but unknown structure. 

3.6.5 Protein Modification and Cleavage 
Confer New Capabilities 

T Proteins are able to perform numerous functions relying solely on 
the versatility of their 20 amino acids. However, many proteins are 
covalently modifed, through the attachment of groups other than amino 
acids, to augment their functions (Figure 3.59). For example, acetyl groups 
are attached to the amino termini of many proteins, a modification that 
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2431 Methinks it is like a weasel2 

FIGURE 3.58 Typíng monkey anology. 

A monkey randomiy poking a typewriter 
could write a line from Shakespeare's 
Hamiet, provided that correct keystrokes 
were retained. In the two Computer 
simulations shown, the cumulative 
number of keystrokes is given at the left 
of each line. 





































HOH2Q 


Q 


2- 


CHAPTER 3 • Protein Structure and Function 



-ooc 


\ 

CH 


coo- 




Carbohydrate-asparagine 

adduct 



FIGURE 3.59 Finíshíng touches. 

Some common and important covalent makes these proteins more resistant to degradation. The addition of hy- 

modifications of amino acid side chains droxyl groups to many proline residues stabilizes fibers of newly synthesized 

are shown. collagen, a fibrous protein found in connective tissue and bone. The bio- 

logical significance of this modification is evident in the disease scurvy: a 
deficiency of vitamin C results in insufficient hydroxylation of collagen and 
the abnormal collagen fibers that result are unable to maintain normal tis¬ 
sue strength. Another specialized amino acid produced by a finishing touch 
is y-carboxyglutamate. In vitamin K deficiency, insufficient carboxylation 
of glutamate in prothrombin, a clotting protein, can lead to hemorrhage. 
Many proteins, especially those that are present on the surfaces of cells or 
are secreted, acquire carbohydrate units on specific asparagine residues. The 
addition of sugars makes the proteins more hydrophilic and able to partic- 
ipate in interactions with other proteins. Conversely, the addition of difatty 
acid to an a-amino group or a cysteine sulfhydryl group produces a more 
hydrophobic protein. 

Many hormones, such as epinephrine (adrenaline), alter the activities of 
enzymes by stimulating the phosphorylation of the hydroxyl amino acids 
serine and threonine; phosphoserine and phosphothreonine are the most ubiq- 
uitous modified amino acids in proteins. Growth factors such as insulin act 
by triggering the phosphorylation of the hydroxyl group of tyrosine residues 
to form phosphotyrosine. The phosphoryl groups on these three modified 
amino acids are readily removed; thus they are able to act as reversible 
switches in regulating cellular processes. The roles of phosphorylation in 
signal transduction will be discussed extensively in Chapter 15. 

The preceding modifications consist of the addition of special groups to 
amino acids. Other special groups are generated by chemical rearrange- 
ments of side chains and, sometimes, the peptide backbone. For example, 
certain jellyfish produce a fluorescent green protein (Figure 3.60). The 
source of the fluorescence is a group formed by the spontaneous rearrange- 
ment and oxidation of the sequence Ser-Tyr-Gly within the center of the 
protein. This protein is of great utility to researchers as a marker within cells 
(Section 4.3.5). 

Finally, many proteins are cleaved and trimmed after synthesis. For ex¬ 
ample, digestive enzymes are synthesized as inactive precursors that can be 
stored safely in the pancreas. After release into the intestine, these precur¬ 
sors become activated by peptide-bond cleavage. In blood clotting, peptide- 
bond cleavage converts soluble fibrinogen into insoluble fibrin. A number 
of polypeptide hormones, such as adrenocorticotropic hormone, arise from 
the splitting of a single large precursor protein. Likewise, many virus pro¬ 
teins are produced by the cleavage of large polyprotein precursors. We shall 
encounter many more examples of modification and cleavage as essential 
features of protein formation and function. Indeed, these finishing touches 
account for much of the versatility, precision, and elegance of protein action 
and regulation. 




SUMMARY 


^ > Proteins are the workhorses of biochemistry, participating in essentially 
all cellular processes. Protein structure can be described at four leveis. 
The primary structure refers to the amino acid sequence. The second- 
ary structure refers to the conformation adopted by local regions of the 
polypeptide chain. Tertiary structure describes the overall folding of the 
polypeptide chain. Finally, quaternary structure refers to the specific 
association of multiple polypeptide chains to form multisubunit 
complexes. 

o Proteins Are Built from a Repertoire of 20 Amino Acids 

Proteins are linear polymers of amino acids. Each amino acid consists 
of a central tetrahedral carbon atom linked to an amino group, a car- 
boxylic acid group, a distinctive side chain, and a hydrogen. These tetra¬ 
hedral centers, with the exception of that of glycine, are chiral; only the 
L isomer exists in natural proteins. All natural proteins are constructed 
from the same set of 20 amino acids. The side chains of these 20 build- 
ing blocks vary tremendously in size, shape, and the presence of func- 
tional groups. They can be grouped as follows: (1) aliphatic side 
chains—glycine, alanine, valine, leucine, isoleucine, methionine, and 
proline; (2) aromatic side chains—phenylalanine, tyrosine, and trypto- 
phan; (3) hydroxyl-containing aliphatic side chains—serine and threo- 
nine; (4) sulfhydryl-containing cysteine; (5) basic side chains—lysine, 
arginine, and histidine; (6) acidic side chains—aspartic acid and glu- 
tamic acid; and (7) carboxamide-containing side chains—asparagine 
and glutamine. These groupings are somewhat arbitrary and many other 
sensible groupings are possible. 


(B) 



^ FIGURE 3.60 Chemical 
rearrangement in GFP. (A) The 

structure of green fluorescent protein (GFP). 
The rearrangement and oxidation of the 
sequence Ser-Try-Gly is the source of 
fluorescence. (B) Fluorescence micrograph 
of a four-cell embryo (cells are outiined) 
from the roundworm C elegans containing 
a protein, PIE-1, labeled with GFP. The 
protein is expressed only in the cell (top) 
that wili give rise to the germiine. 

[(B) Courtesy of Geraldine Seydoux.] 
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The amino acids in a polypeptide are linked by amide bonds formed be- 
tween the carboxyl group of one amino acid and the amino group of the 
next. This linkage, called a peptide bond, has several important prop- 
erties. First, it is resistant to hydrolysis so that proteins are remarkably 
stable kinetically. Second, the peptide group is planar because the C-N 
bond has considerable double-bond character. Third, each peptide bond 
has both a hydrogen-bond donor (the NH group) and a hydrogen-bond 
acceptor (the CO group). Hydrogen bonding between these backbone 
groups is a distinctive feature of protein structure. Finally, the peptide 
bond is uncharged, which alio ws proteins to form tightly packed glob¬ 
ular structures having significant amounts of the backbone buried 
within the protein interior. Because they are linear polymers, proteins 
can be described as sequences of amino acids. Such sequences are writ- 
ten from the amino to the carboxyl terminus. 

• Secondary Structure: Polypeptide Chains Can Fold into Regular Structures 
Such as the Alpha Helix, the Beta Sheet, and Turns and Loops 

Two major elements of secondary structure are the a helix and the |3 
strand. In the a helix, the polypeptide chain twists into a tightly packed 
rod. Within the helix, the CO group of each amino acid is hydrogen 
bonded to the NH group of the amino acid four residues along the 
polypeptide chain. In the |3 strand, the polypeptide chain is nearly fully 
extended. Two or more |3 strands connected by NH-to-CO hydrogen 
bonds come together to form P sheets. 

• Tertiary Structure: Water-Soluble Proteins Fold into Compact Structures 
with Nonpolar Cores 

The compact, asymmetric structure that individual polypeptides attain 
is called tertiary structure. The tertiary structures of water-soluble pro¬ 
teins have features in common: (1) an interior formed of amino acids with 
hydrophobic side chains and (2) a surface formed largely of hydrophilic 
amino acids that interact with the aqueous environment. The driving 
force for the formation of the tertiary structure of water-soluble proteins 
is the hydrophobic interactions between the interior residues. Some pro¬ 
teins that exist in a hydrophobic environment, in membranes, display 
the inverse distribution of hydrophobic and hydrophilic amino acids. In 
these proteins, the hydrophobic amino acids are on the surface to inter¬ 
act with the environment, whereas the hydrophilic groups are shielded 
from the environment in the interior of the protein. 

• Quaternary Structure: Polypeptide Chains Can Assemble 
into Multisubunit Structures 

Proteins consisting of more than one polypeptide chain display quater¬ 
nary structure, and each individual polypeptide chain is called a sub- 
unit. Quaternary structure can be as simple as two identical subunits or 
as complex as dozens of different subunits. In most cases, the subunits 
are held together by noncovalent bonds. 

• The Amino Acid Sequence of a Protein Determines Its 
Three-Dimensional Structure 

The amino acid sequence completely determines the three-dimensional 
structure and, hence, all other properties of a protein. Some proteins can 
be unfolded completely yet refold efficiently when placed under condi- 
tions in which the folded form of the protein is stable. The amino acid 
sequence of a protein is determined by the sequences of bases in a DNA 
molecule. This one-dimensional sequence Information is extended into 
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neously. Protein folding is a highly cooperativo process; structural in- Appendix: Acid-Base Concepts 

termediates between the unfolded and folded forms do not accumulate. 

The versatility of proteins is further enhanced by covalent modifi- 
cations. Such modifications can incorporate functional groups not pres- 
ent in the 20 amino acids. Other modifications are important to the reg- 
ulation of protein activity. Through their structural stability, diversity, 
and Chemical reactivity, proteins make possible most of the key 
processes associated with life. 


APPENDIX; ACID-BASE CONCEPTS 


lonization of Water 

Water dissociates into hydronium (HaO^) and hydroxyl (OH~) 
ions. For simplicity, we refer to the hydronium ion as a hydro- 
gen ion (H^) and write the equilibrium as 

H20^^H+ + 0H“ 

The equilibrium constant K^q of this dissociation is given by 

i^eq=[H^][0H-]/[H20] (1) 

in which the terms in brackets denote molar concentrations. Be- 
cause the concentration of water (55.5 M) is changed little by 
ionization, expression 1 can be simplified to give 

Kw=[H^][OH-] (2) 

in which is the ion product of water. At 25°C, is 1.0 X 

Note that the concentrations of and OH~ are recipro- 
cally related. If the concentration of is high, then the con¬ 
centration of OH~ must be low, and vice versa. For example, if 
[H+] = 10“2 M, then [OH“] = 10 “^^ 

Definition of Add and Base 

An acid is a proton donor. A base is a proton acceptor. 

Acid ^ + base 

CH 3 COOH H+ + CHaCOO” 

Acetic acid Acetate 

NH4'" + NH3 

Ammonium ion Ammonia 

The species formed by the ionization of an acid is its conjugate 
base. Conversely, protonation of a base yields its conjugate acid. 
Acetic acid and acetate ion are a conjugate acid-base pair. 

Definition of pH and plí 

The pH of a solution is a measure of its concentration of H^. 
The pH is defined as 

pH = logio(l/[H^])=-logio[H^] (3) 

The ionization equilibrium of a weak acid is given by 
HA H+ + A“ 

The apparent equilibrium constant for this ionization is 

i^a=[H^][A-]/[HA] (4) 


The pfCa of an acid is defined as 

pfCa = -log iCa = log(l/Ka) (5) 

Inspection of equation 4 shows that the pKa of an acid is the pH 
at which it is half dissociated, when [A~] = [HA]. 

Henderson-Hasseibalch Equation 

What is the relation between pH and the ratio of acid to base? 
A useful expression can be derived from equation 4. Rearrange- 
ment of that equation gives 

l/[H+] = l/Ra[A-]/[HA] (6) 

Taking the logarithm of both sides of equation 6 gives 

log(l/[H^]) = log(l/Ra) + log([A-]/[HA]) (7) 

Substituting pH for log 1 /[H^] and pfCa log l/Kg^in equa¬ 
tion 7 yields 

pH = pRa + log([A-]/[HA]) ( 8 ) 

which is commonly known as the Henderson-Hasseibalch 
equation. 

The pH of a solution can be calculated from equation 8 if 
the molar proportion of A~ to HA and the pR^ of HA are known. 
Consider a solution of 0.1 M acetic acid and 0.2 M acetate ion. 
The pRa of acetic acid is 4.8. Hence, the pH of the solution is 
given by 

pH = 4.8 + log( 0 . 2 / 0 . 1 ) = 4.8 + log 2.0 = 4.8 + 0.3 = 5.1 

Conversely, the pRa of an acid can be calculated if the molar pro¬ 
portion of A~ to HA and the pH of the solution are known. 

Buffers 

An acid-base conjugate pair (such as acetic acid and acetate ion) 
has an important property: it resists changes in the pH of a so¬ 
lution. In other words, it acts as a buffer. Consider the addition 
of OH~ to a solution of acetic acid (HA): 

HA + OH“^^ A“ + H20 

A plot of the dependence of the pH of this solution on the 
amount of OH~ added is called a titration curve (Figure 3.61). 
Note that there is an inflection point in the curve at pH 4.8, 
which is the pRa of acetic acid. In the vicinity of this pH, a rel- 
atively large amount of OH~ produces little change in pH. In 
other words, the buffer maintains the value of pH near a given 
value, despite the addition of other either protons or hydroxide 
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FIGURE 3.61 Titratíon curve of acetíc acíd. 

ions. In general, a weak acid is most effective in buffering against 
pH changes in the vicinity of its value. 

plía Values of Amino Acids 

An amino acid such as glycine contains two ionizable groups: 
an a-carboxyl group and a protonated a-amino group. As base 
is added, these two groups are titrated (Figure 3.62). The 
of the a-COOH group is 2.4, whereas that of the a-NH 3 ^ 
group is 9.8. The pK^ values of these groups in other amino 


r 


TABLE 3.4 pK^ values of some amino acids 


Amino acid 

pKa values (25 °C) 


a-COOH 

group 

a-NH3+ 

group 

Side 

chain 

Alanine 

2.3 

9.9 


Glycine 

2.4 

9.8 


Phenylalanine 

1.8 

9.1 


Serine 

2.1 

9.2 


Valine 

2.3 

9.6 


Aspartic acid 

2.0 

10.0 

3.9 

Glutamic acid 

2.2 

9.7 

4.3 

Histidine 

1.8 

9.2 

6.0 

Cysteine 

1.8 

10.8 

8.3 

Tyrosine 

2.2 

9.1 

10.9 

Lysine 

2.2 

9.2 

10.8 

Arginine 

1.8 

9.0 

12.5 


After J. T. Edsall and J. Wyman, Biophysical Chemistry 
(Academic Press, 1958), Chapter 8. 


acids are similar (Table 3.4). Some amino acids, such as aspar- 
tic acid, also contain an ionizable side chain. The pfCa values of 
ionizable side chains in amino acids range from 3.9 (aspartic 
acid) to 12.5 (arginine). 



|OH~| HOH 

H+ 



|OH~| HOH 

H+ 


H2N 



FIGURE 3.62 Titration of the a-carboxyl 
and a-amino groups of an amino acid. 
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side chain (R group) (p. 43) 

L amino acid (p. 43) 

dipolar ion (zwitterion) (p. 43) 

peptide bond (amide bond) (p. 51) 

disulfide bond (p. 52) 

primary structure (p. 53) 

phi (c|)) angle (p. 55) 


psi (i|;) angle (p. 55) tertiary structure (p. 62) 

Ramachandran diagram (p. 55) domain (p. 63) 

a helix (p. 56) subunit (p. 64) 

P pleated sheet (p. 58) quaternary structure (p. 64) 

P strand (p. 58) cooperative transition (p. 68) 

reverse turn (P turn; hairpin turn) (p. 60) 
secondary structure (p. 61) 
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-1 PROBLEMS I- 

1 . Shape and dimension. (a) Tropomyosin, a 70-kd muscle 
protein, is a two-stranded a-helical coiled coil. Estimate the 
length of the molecule? (b) Suppose that a 40-residue segment 
of a protein folds into a two-stranded antiparallel |3 structure 
with a 4-residue hairpin turn. What is the longest dimension 
of this motif? 

2. Contrasting isomers. Poly-L-leucine in an organic solvent 
such as dioxane is a helical, whereas poly-L-isoleucine is not. 
Why do these amino acids with the same number and kinds of 
atoms have different helix-forming tendencies? 

3. Active again. A mutation that changes an alanine residue in 
the interior of a protein to valine is found to lead to a loss of ac- 
tivity. However, activity is regained when a second mutation at 
a different position changes an isoleucine residue to glycine. 
How might this second mutation lead to a restoration of activ¬ 
ity? 

4. Shuffle test. An enzyme that catalyzes disulfide-sulfhydryl 
exchange reactions, called protein disulfide isomerase (PDI), has 
been isolated. PDI rapidly converts inactive scrambled ribonu- 
clease into enzymatically active ribonuclease. In contrast, insulin 
is rapidly inactivated by PDI. What does this important obser- 
vation imply about the relation between the amino acid sequence 
of insulin and its three-dimensional structure? 

5. Stretching a target. A protease is an enzyme that catalyzes 
the hydrolysis of the peptide bonds of target proteins. How 
might a protease bind a target protein so that its main chain be- 
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comes fully extended in the vicinity of the vulnerable peptide 
bond? 

6 . Often irreplaceable. Glycine is a highly conserved amino acid 
residue in the evolution of proteins. Why? 

7. Potential partners. Identify the groups in a protein that can 
form hydrogen bonds or electrostatic bonds with an arginine side 
chain at pH 7. 

8 . Permanent waves. The shape of hair is determined in part by 
the pattern of disulfide bonds in keratin, its major protein. How 
can curls be induced? 

9. Location is everything. Proteins that span biological mem- 
branes often contain a helices. Given that the insides of mem- 
branes are highly hydrophobic (Section 12.2.1), predict what 
type of amino acids would be in such a helix. Why is an a helix 
particularly suited to exist in the hydrophobic environment of 
the interior of a membrane? 

10 . Issues of stability. Proteins are quite stable. The lifetime of 
a peptide bond in aqueous solution is nearly 1000 years. How¬ 
ever, the AG°' of hydrolysis of proteins is negative and quite 
large. How can you account for the stability of the peptide bond 
in light of the fact that hydrolysis releases much energy? 

11. Minor species. For an amino acid such as alanine, the major 
species in solution at pH 7 is the zwitterionic form. Assume a 
pKa value of 8 for the amino group and a pK^ value of 3 for the 
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carboxylic acid and estimate the ratio of the concentration of 
neutral amino acid species (with the carboxylic acid protonated 
and the amino group neutral) to that of the zwittcrionic species 
at pH 7. 

12. A matter of convention. All L amino acids have an S absolute 
configuration except L-cysteine, which has the R configuration. 
Explain why L-cysteine is designated as the R absolute config¬ 
uration. 

13. Hidden message. Translate the foliowing amino acid se- 
quence into one-letter code: Ser-Ala-Val-Glu-Met-Glu-Ile-Met- 
Thr-Arg-Ala-Pro-Pro-Glu-Asp-Ile-Asn-Ala-Gly-Glu-Asn- 
Glu. Note the message. 

14. Who goes first? Would you expect Pro-X peptide bonds to 
tend to have cis conformations like those of X-Pro bonds? Why 
or why not? 

15. Matching, For each of the amino acid derivatives shown be- 
low (A-E), find the matching set of c|) and i|; values (a-e). 



16. Concentrate of the concentration. A solution of a protein 
whose sequence includes three tryptophan residues, no tyrosine 
residues, and no phenylalanine residues has an absorbance of 0.1 
at 280 nm in a cell with a path length of 1 cm. Estimate the con¬ 
centration of the protein in units of molarity. If the protein has 
a molecular mass of 100 kd, estimate the concentration in units 
of milligrams of protein per milliliter of solution. 

Media Problem 

You can use the Structural Insights and Conceptual Insights as 

visual aids to help you answer Media Problems. Go to the Website: 
www.whfreeman.conn/biochennS, and select the appiicable module. 

17. Inside-out, back-to-front. In the Media Problem section of 
the Structural Insights module on protein structure, you can 
examine molecular models of four putative protein structures. 
One of the four structures has been determined by x-ray crys- 
tallography. The other three have been made-up, and in fact are 
very unlikely to occur. Which are the structures that are unlikely 
to occur and why? 



(a) (b) (c) (d) (e) 

cl) = 120°, cl) = 180°, cl) = 180°, cl) = 0°, cl) = -60°, 

ili = 120° ili = 0° ili = 180° il;=180° ili=-40° 


Exploring Proteins 




Mass/charge 


Mílk, a source of nourishment for all mammals, is composed, in part, of a varíety 
of proteins. The protein components of milk are revealed by the technique of MALDI-TOF 
mass spectrometry, which separates molecules on the basis of their mass to charge ratio. 
[(Left) Jean Paul Iris/FPG (Right) courtesy of Brian Chait] 


OUTLINE 


In the preceding chapter, we saw that proteins play crucial roles in nearly all 
biological processes—in catalysis, signal transmission, and structural sup- 
port. This remarkable range of functions arises from the existence of thou- 
sands of proteins, each folded into a distinctive three-dimensional structure 
that enables it to interact with one or more of a highly 

diverse array of molecules. A major goal of biochemistry is _ 

to determine how amino acid sequences specify the confor- 
mations of proteins. Other goals are to learn how individ¬ 
ual proteins bind specific substrates and other molecules, 
mediate catalysis, and transduce energy and information. 

The purification of the protein of interest is the indis- 
pensable first step in a series of studies aimed at exploring 
protein function. Proteins can be separated from one an- 
other on the basis of solubility, size, charge, and binding 
ability. When a protein has been purified, the amino acid 
sequence can be determined. The strategy is to divide and 
conquer, to obtain specific fragments that can be readily se- 
quenced. Automated peptide sequencing and the applica- 
tion of recombinant DNA methods are providing a wealth 
of amino acid sequence data that are opening new vistas. To 
understand the physiological context of a protein, antibod- 
ies are choice probes for locating proteins in vivo and meas- 
uring their quantities. Monoclonal antibodies able to probe 
for specific proteins can be obtained in large amounts. The 
synthesis of peptides is possible, which makes feasible the 
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synthesis of new drugs, functional protein fragments, and antigens for in- 
ducing the formation of specific antibodies. Nuclear magnetic resonance 
(NMR) spectroscopy and x-ray crystallography are the principal techniques 
for elucidating three-dimensional structure, the key determinant of function. 

The exploration of proteins by this array of physical and chemical tech¬ 
niques has greatly enriched our understanding of the molecular basis of life 
and makes it possible to tackle some of the most challenging questions of 
biology in molecular terms. 

4.0.1 The Proteome Is the Functional Representation 
of the Genome 

Many organisms are yielding their DNA base sequences to gene sequencers, 
including several metazoans. The roundworm Caenorhabditis elegans has a 
genome of 97 million bases and about 19,000 protein-encoding genes, 
whereas that of the fruit fly Drosophilia melanogaster contains 180 million 
bases and about 14,000 genes. The incredible progress being made in gene 
sequencing has already culminated in the elucidation of the complete se- 
quence of the human genome, all 3 billion bases with an estimated 40,000 
genes. But this genomic knowledge is analogous to a list of parts for a car— 
it does not explain how the parts work together. A new word has been 
coined, the proteome, to signify a more complex levei of Information con- 
tent, the levei o{functional Information, which encompasses the type, func- 
tions, and interactions of proteins that yield a functional unit. 

The term proteome is derived from proteins expressed by the genome. 
Whereas the genome tells us what is possible, the proteome tells us what is 
functionally present—for example, which proteins interact to form a signal- 
transduction pathway or an ion channel in a membrane. The proteome is 
not a fixed characteristic of the cell. Rather, because it represents the func¬ 
tional expression of Information, it varies with cell type, developmental 
stage, and environmental conditions, such as the presence of hormones. The 
proteome is much larger than the genome because of such factors as alter- 
natively spliced RNA, the posttranslational modification of proteins, the 
temporal regulation of protein synthesis, and varying protein-protein in¬ 
teractions. Unlike the genome, the proteome is not static. 

An understanding of the proteome is acquired by investigating, charac- 
terizing, and cataloging proteins. An investigator often begins the process 
by separating a particular protein from all other biomolecules in the cell. 


4.1 THE PURIFICATION OF PROTEINS IS AN ESSENTIAL 
FIRST STEP IN UNDERSTANDING THEIR FUNCTION 

An adage of biochemistry is, Never waste pure thoughts on an impure pro¬ 
tein. Starting from pure proteins, we can determine amino acid sequences 
and evolutionary relationships between proteins in diverse organisms and 
we can investigate a protein s biochemical function. Moreover, crystals of 
the protein may be grown from pure protein, and from such crystals we can 
obtain x-ray data that will provide us with a picture of the protein s tertiary 
structure—the dictual functional unit. 

4.1.1 The Assay: How Do We Recognize the Protein That We 
Are Looking For? 

Purification should yield a sample of protein containing only one type of 
molecule, the protein in which the biochemist is interested. This protein 




sample may be only a fraction of 1% of the starting material, whether that 
starting material consists of cells in culture or a particular organ from a plant 
or animal. How is the biochemist able to isolate a particular protein from a 
complex mixture of proteins? 

The biochemist needs a test, called an assay, for some unique identify- 
ing property of the protein so that he or she can tell when the protein is pres- 
ent. Determining an effective assay is often difficult; but the more specific 
the assay, the more effective the purification. For enzymes, which are pro¬ 
tein catalysts (Chapter 8), the assay is usually based on the reaction that the 
enzyme catalyzes in the cell. Consider the enzyme lactate dehydrogenase, 
an important player in the anaerobic generation of energy from glucose as 
well as in the synthesis of glucose from lactate. Lactate dehydrogenase car- 
ries out the following reaction: 




HO- 


-H + NAD+ 


Lactate 

dehydrogenase 


+ NADH + H+ 


CH3 

Lactate 


O" XH3 

Pyruvate 


Nicotinamide adenine dinucleotide [reduced (NADH); Section 14.3.1] 
is distinguishable from the other components of the reaction by its ability 
to absorb light at 340 nm. Consequently, we can follow the progress of the 
reaction by examining how much light the reaction mixture absorbs at 
340 nm in unit time—for instance, within 1 minute after the addition of the 
enzyme. Our assay for enzyme activity during the purification of lactate 
dehydrogenase is thus the increase in absorbance of light at 340 nm observed 
in 1 minute. 

To be certain that our purification scheme is working, we need one ad- 
ditional piece of information —the amount of protein present in the mixture 
being assayed. There are various rapid and accurate means of determining 
protein concentration. With these two experimentally determined num- 
bers—enzyme activity and protein concentration—we then calculate the 
specific activity, the ratio of enzyme activity to the amount of protein in the 
enzyme assay. The specific activity will rise as the purification proceeds and 
the protein mixture being assayed consists to a greater and greater extent of 
lactate dehydrogenase. In essence, the point of the purification is to maxi¬ 
mize the specific activity. 
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4.1.2 Proteins Must Be Released from the Cell to Be Purified 

Having found an assay and chosen a source of protein, we must now frac- 
tionate the cell into components and determine which component is en- 
riched in the protein of interest. Such fractionation schemes are developed 
by trial and error, on the basis of previous experience. In the first step, a 
homogenate is formed by disrupting the cell membrane, and the mixture is 
fractionated by centrifugation, yielding a dense pellet of heavy material at 
the bottom of the centrifuge tube and a lighter supernatant above (Figure 
4.1). The supernatant is again centrifuged at a greater force to yield yet an- 
other pellet and supernatant. The procedure, called differential centrifuga¬ 
tion, yields several fractions of decreasing density, each still containing hun- 
dreds of different proteins, which are subsequently assayed for the activity 
being purified. Usually, one fraction will be enriched for such activity, and 
it then serves as the source of material to which more discriminating pu¬ 
rification techniques are applied. 
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Centrifuge 
at 500 X g 
for 10 nninutes 



Honnogenate 

fornns 


FIGURE 4.1 Differentíal centrifugation. 

Cells are disrupted in a honnogenizer and 
the resulting mixture, called the 
honnogenate, is centrifuged in a step-by- 
step fashion of increasing centrifugai force. 
The denser material wili form a pellet at 
lower centrifugai force than wilI the less- 
dense material. The isolated fractions can 
be used for further purification. [Photographs 
courtesy of S. Fleischer and B. Fleischer.] 
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4.1.3 Proteins Can Be Purified According to Solubility, 

Size, Charge, and Binding Affinity 

Several thousand proteins have been purified in active form on the basis of 
such characteristics as solubility, size, charge, and specific binding affinity. 
Usually, protein mixtures are subjected to a series of separations, each based 
on a different property to yield a pure protein. At each step in the purifica¬ 
tion, the preparation is assayed and the protein concentration is determined. 
Substantial quantities of purified proteins, of the order of many milligrams, 
are needed to fully elucidate their three-dimensional structures and their 
mechanisms of action. Thus, the overall yield is an important feature of a 
purification scheme. A variety of purification techniques are available. 

Salting Out. Most proteins are less soluble at high salt concentrations, 
an effect called salting out. The salt concentration at which a protein pre- 
cipitates differs from one protein to another. Hence, salting out can be used 
to fractionate proteins. For example, 0.8 M ammonium sulfate precipitates 
fibrinogen, a blood-clotting protein, whereas a concentration of 2.4 M is 
needed to precipitate serum albumin. Salting out is also useful for concen- 
trating dilute Solutions of proteins, including active fractions obtained from 
other purification steps. Dialysis can be used to remove the salt if necessary. 

Dialysis. Proteins can be separated from small molecules by dialysis 
through a semipermeable membrane, such as a cellulose membrane with 























































pores (Figure 4.2). Molecules having dimensions signif- 
icantly greater than the pore diameter are retained inside 
the dialysis bag, whereas smaller molecules and ions tra- 
verse the pores of such a membrane and emerge in the 
dialysate outside the bag. This technique is useful for re¬ 
mo ving a salt or other small molecule, but it will not dis- 
tinguish between proteins effectively. 

Gel-Filtration Chromatography. More discrimina- 
ting separations on the basis of size can be achieved by 
the technique of gel-filtration chromatography (Figure 4.3). 

The sample is applied to the top of a column consisting 
of porous beads made of an insoluble but highly hydrated 
polymer such as dextran or agarose (which are carbohy- 
drates) or polyacrylamide. Sephadex, Sepharose, and Bio- 
gel are commonly used commercial preparations of these 
beads, which are typically 100 fjim (0.1 mm) in diameter. 

Small molecules can enter these beads, but large ones can- 
not. The result is that small molecules are distributed in the aqueous solu- 
tion both inside the beads and between them, whereas large molecules are 
located only in the solution between the beads. Large molecules flow more 
rapidly through this column and emerge first because a smaller volume is acces- 
sible to them. Molecules that are of a size to occasionally enter a bead will 
flow from the column at an intermediate position, and small molecules, 
which take a longer, tortuous path, will exit last. 




FIGURE 4.2 Dialysis. Protein molecules 
(red) are retained within the dialysis bag, 
whereas small molecules (blue) diffuse 
into the surrounding médium. 
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FIGURE 4.3 Gel filtration chromatography. A mixture of proteins in a small volume is 
applied to a column filled with porous beads. Because large proteins cannot enter the 
internai volume of the beads, they emerge sooner than do small ones. 


lon-Exchange Chromatography. Proteins can be separated on the 
basis of their net charge by ion-exchange chromatography. If a protein has a 
net positive charge at pH 7, it will usually bind to a column of beads con- 
taining carboxylate groups, whereas a negatively charged protein will not 








































































Positively charged 
protein binds to 
negatively charged 
bead 


Negatively charged 
protein flows 
through 


FIGURE 4.4 lon-exchange 
chromatography. This technique 
separates proteins mainly according 
to their net charge. 


(Figure 4.4). A positively charged protein bound to such a column can then 
be eluted (released) by increasing the concentration of sodium chloride or 
another salt in the eluting buffer because sodium ions compete with posi¬ 
tively charged groups on the protein for binding to the column. Proteins 
that have a low density of net positive charge will tend to emerge first, 
followed by those having a higher charge density. Positively charged 
proteins (cationic proteins) can be separated on negatively charged 
carboxymethyl-cellulose (CM-cellulose) columns. Conversely, negatively 
charged proteins (anionic proteins) can be separated by chromatography on 
positively charged diethylaminoethyl-cellulose (DEAE—cellulose) columns. 



Carboxymethyl 
(CM) group 

(ionized form) 



(protonated form) 


Glucose-binding 
protein attaches 
to glucose 
residues (G) on 
beads 



Addition of 
glucose (G) 


Glucose-binding 
proteins are 
released on 
addition of 
glucose 



FIGURE 4.5 Affinity chromatography. 

Affinity chromatography of concanavalin 
A (shown in yellow) on a solid support 
containing covalentiy attached glucose 
residues (G). 


Affinity Chromatography. Affinity chromatography is another pow- 
erful and generally applicable means of purifying proteins. This technique 
takes advantage of the high affinity of many proteins for specific chemical 
groups. For example, the plant protein concanavalin A can be purified by 
passing a crude extract through a column of beads containing covalently at¬ 
tached glucose residues. Concanavalin A binds to such a column because it 
has affinity for glucose, whereas most other proteins do not. The bound con¬ 
canavalin A can then be released from the column by adding a concentrated 
solution of glucose. The glucose in solution displaces the column-attached 
glucose residues from binding sites on concanavalin A (Figure 4.5). Affin¬ 
ity chromatography is a powerful means of isolating transcription factors, 
proteins that regulate gene expression by binding to specific DNA se- 
quences. A protein mixture is percolated through a column containing spe¬ 
cific DNA sequences attached to a matrix; proteins with a high affinity for 
the sequence will bind and be retained. In this instance, the transcription 
factor is released by washing with a solution containing a high concentra¬ 
tion of salt. In general, affinity chromatography can be effectively used to 
isolate a protein that recognizes group X by (1) covalently attaching X or a 
derivative of it to a column, (2) adding a mixture of proteins to this column, 
which is then washed with buffer to remove unbound proteins, and (3) elut¬ 
ing the desired protein by adding a high concentration of a soluble form of 
X or altering the conditions to decrease binding affinity. Affinity chro¬ 
matography is most effective when the interaction of the protein and the 
molecule that is used as the bait is highly specific. 

High-Pressure Liquid Chromatography. The resolving power of all 
of the column techniques can be improved substantially through the use of 
a technique called high-pressure liquid chromatography (HPLC), which is an 
enhanced version of the column techniques already discussed. The column 
materiais themselves are much more finely divided and, as a consequence, 
there are more interaction sites and thus greater resolving power. Because 
the column is made of finer material, pressure must be applied to the col¬ 
umn to obtain adequate flow rates. The net result is high resolution as well 
as rapid separation (Figure 4.6). 
















4.1.4 Proteins Can Be Separated by Gel Electrophoresis 
and Displayed 

How can we tell whether a purification scheme is effective? One way is to 
ascertain that the specific activity rises with each purification step. Another 
is to visualize the effectiveness by displaying the proteins present at each 
step. The technique of electrophoresis makes the latter method possible. 


Gel Electrophoresis. A molecule with a net charge will move in an 
electric field. This phenomenon, termed electrophoresis, offers a powerful 
means of separating proteins and other macromolecules, such as DNA and 
RNA. The velocity of migration (v) of a protein (or any molecule) in an 
electric field depends on the electric field strength (E), the net charge on 
the protein (z), and the frictional coefficient (f). 


V = 


f 


( 1 ) 


The electric force Ez driving the charged molecule toward the oppositely 
charged electrode is opposed by the viscous drag fv arising from friction 
between the moving molecule and the médium. The frictional coefficient/ 
depends on both the mass and shape of the migrating molecule and the vis- 
cosity (rf) of the médium. For a sphere of radius r, 


-1 83 I - 

Protein Purification 



f— ÒTTTir (2) 

Electrophoretic separations are nearly always carried out in gels (or on solid 
supports such as paper) because the gel serves as a molecular sieve that en- 
hances separation (Figure 4.7). Molecules that are small compared with the 
pores in the gel readily move through the gel, whereas molecules much larger 
than the pores are almost immobile. Intermediate-size molecules move 
through the gel with various degrees of facility. Electrophoresis is performed 


FIGURE 4.6 Hígh-pressure líquid 
chromatography (HPLC). Gel filtration 
by HPLC clearly defines the individual 
proteins because of its greater resolving 
power: (1) thyroglobulin (669 kd), 

(2) catalase (232 kd), (3) bovine serum 
albumin (67 kd), (4) ovalbumin (43 kd), 
and (5) ribonuclease (13.4 kd). [After K. J. 
Wilson and T. D. Schiabach. In Current Protocols 
in Molecular Biology, vol. 2, suppi. 41, F. M. 
Ausbel, R. Brent, R. E. Kingston, D. D. Moore, 

J. G. Seidman, J. A. Smith, and K. Struhl, Eds. 
(Wiley, 1998), p. 10.14.1.] 



FIGURE 4.7 Polyacrylamíde gel electrophoresis. (A) Gel electrophoresis apparatus. 
Typically, several samples undergo electrophoresis on one flat polyacrylamide gel. A 
microliter pipette is used to place Solutions of proteins in the wells of the slab. A cover is 
then placed over the gel chamber and voltage is appiied. The negatively charged SDS 
(sodium dodecyl sulfate)-protein complexes migrate in the direction of the anode, at the 
bottom of the gel. (B) The sieving action of a porous polyacrylamide gel separates proteins 
according to size, with the smallest moving most rapidiy. 
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FIGURE 4.8 Formatíon of a 
polyacrylamide gel. A three-dimensíonal 
mesh is formed by co-polymerizing 
activated monomer (blue) and 
cross-linker (red). 
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in a thin, vertical slab of polyacrylamide. The direction of flow is from top 
to bottom. Polyacrylamide gels, formed by the polymerization of acrylamide 
and cross-linked by methylenebisacrylamide, are choice supporting media 
for electrophoresis because they are chemically inert and are readily formed 
(Figure 4.8). Electrophoresis is the opposite of gel filtration in that all of the 
molecules, regardless of size, are forced to move through the same matrix. 
The gel behaves as one bead of a gel-filtration column. 

Proteins can be separated largely on the basis of mass by electrophoresis 
in a polyacrylamide gel under denaturing conditions. The mixture of pro¬ 
teins is first dissolved in a solution of sodium dodecyl sulfate (SDS), an an- 
ionic detergent that disrupts nearly all noncovalent interactions in native 
proteins. Mercaptoethanol (2-thioethanol) or dithiothreitol also is added to 
reduce disulfide bonds. Anions of SDS bind to main chains at a ratio of about 
one SDS anion for every two amino acid residues. This complex of SDS with 
a denatured protein has a large net negative charge that is roughly propor- 
tional to the mass of the protein. The negative charge acquired on binding 
SDS is usually much greater than the charge on the native protein; this na¬ 
tive charge is thus rendered insignificant. The SDS-protein complexes are 
then subjected to electrophoresis. When the electrophoresis is complete, the 
proteins in the gel can be visualized by staining them with silver or a dye such 
as Coomassie blue, which reveals a series of bands (Figure 4.9). Radioactive 


FIGURE 4.9 Staining of proteins after 
electrophoresis. Proteins subjected to 
electrophoresis on an SDS-polyacrylamide 
gel can be visualized by staining with 
Coomassie blue. [Courtesy of Kodak Scientific 
Imaging Systems.] 









labeis can be detected by placing a sheet of x-ray film o ver the gel, a pro- 
cedure called autoradiography. 

Small proteins move rapidly through the gel, whereas large proteins stay at 
the top, near the point of application of the mixture. The mobility of most 
polypeptide chains under these condi tions is linearly proportional to the log- 
arithm of their mass (Figure 4.10). ( Some carbohydrate-rich protei ns and) 
( membrane proteins do not obey this empirical relation, however.j SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) is rapid, sensitive, and 
capable of a high degree of resolution. As little as 0.1 jag (^2 pmol)) of a 
protein gives a distinct band when stained with Coomassie blue, and even 
less (—0.02 jüLg) can be dete cted with a silver stain. Proteins that differ in 
mass by about 2% (e.g.,(40 and 41 kd,)arising from a difference of about(TÕ) 
residues) can usually be distinguished. 

We can examine the efficacy of our purification scheme by analyzing a 
part of each fraction by SDS-PAGE. The initial fractions will display dozens 
to hundreds of proteins. As the purification progresses, the number of bands 
will diminish, and the prominence of one of the bands should increase. This 
band will correspond to the protein of interest. 

Isoelectric Focusing. Proteins can also be separated electrophoreti- 
cally on the basis of their relative contents of acidic and basic residues. The 
isoelectric point (pl) of a protein is the pH at which its net charge is zero. 
At this pH, its electrophoretic mobility is zero because 2 : in equation 1 is 
equal to zero. For example, the pl of cytochrome c, a highly basic electron- 
transport protein, is 10.6, whereas that of serum albumin, an acidic protein 
in blood, is 4.8. Suppose that a mixture of proteins undergoes elec¬ 
trophoresis in a pH gradient in a gel in the absence of SDS. Each protein 
will move until it reaches a position in the gel at which the pH is equal to 
the pl of the protein. This method of separating proteins according to their 
isoelectric point is called isoelectric focusing. The pH gradient in the gel is 
formed first by subjecting a mixture of polyampholytes (small multicharged 
polymers) having many pl values to el ectrophoresis. Isoelectri c focusing can 
readily resolve protein s that differ in( pl by as little as 0.01, )which mea ns 
that proteins differing (by one net charge can be separated (Figure 4.11).) 



Two-Dimensional Electrophoresis. Isoelectric focusing can be com- 
bined with SDS-PAGE to obtain very high resolution separations. A sin¬ 
gle sample is first subjected to isoelectric focusing. This single-lane gel is 
then placed horizontally on top of an SDS-polyacrylamide slab. The pro¬ 
teins are thus spread across the top of the polyacrylamide gel according to 
how far they migrated during isoelectric focusing. They then undergo elec¬ 
trophoresis again in a perpendicular direction (vertically) to yield a two- 
dimensional pattern of spots. In such a gel, proteins have been separated in 
the horizontal direction on the basis of isoelectric point and in the vertical 
direction on the basis of mass. It is remarkable that more than a thousand 
different proteins in the bacterium Escherichia coli can be resolved in a sin¬ 
gle experiment by two-dimensional electrophoresis (Figure 4.12). 
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Relative mobility 


FIGURE 4.10 Electrophoresis can 
determine mass. The electrophoretic 
mobility of many proteins in 
SDS-polyacrylamide gels is inversely 
proportional to the logarithm of their 
mass. [After K. Weber and M. Osborn, 

The Proteins, vol. 1, 3d ed. (Acadennic Press, 
1975), p. 179.] 


FIGURE 4.11 The principie of 
isoelectric focusing. A pH gradient is 
established in a gel before loading the 
sample. (A) The sample is loaded and 
voltage is appiied. The proteins will 
migrate to their isoelectric pH, the location 
at which they have no net charge. (B) The 
proteins form bands that can be excised 
and used for further experimentation. 


























FIGURE 4.12 Two-dímensional gel 
electrophoresis. (A) A protein sample is 
initially fractionated in one dimension by 
isoelectric focusing as described in Figure 
4.11. The isoelectric focusing gel is then 
attached to an SDS-polyacrylamide gel, 
and electrophoresis is performed in the 
second dimension, perpendicular to the 
original separation. Proteins with the same 
pl are now separated on the basis of mass. 
(B) Proteins from £ coli were separated 
by two-dimensional gel electrophoresis, 
resolving more than a thousand different 
proteins. The proteins were first separated 
according to their isoelectric pH in the 
horizontal direction and then by their 
apparent mass in the vertical direction. 

[(B) Courtesy of Dr. Patrick H. 0'Farrell.] 


Proteins isolated from cells under (different physiological conditions)can 
be subjected to two-dimensional ele ctrophoresis, folio wed by an (examina-) 
(tion^of the intensity of the signal^ In this way, particular proteins can be 
seen to increase or decrease in concentration in response to the physiologi¬ 
cal State. How can we tell what protein is being regulated? A former draw- 
back to the power of the two-dimensional gel is that, although many pro¬ 
teins are displayed, they are not identified. It is now possible to identify 
proteins by coupling two-dimensional gel electrophoresis with mass spec- 
trometric techniques. We will consider these techniques when we examine 
how the mass of a protein is determined (Section 4.1.6). 

4.1.5 A Protein Purification Scheme Can 
Be Quantitatively Evaluated 

To determine the success of a protein purification scheme, we monitor the 
procedure at each step by determining specific activity and by performing 
an SDS-PAGE analysis. Consider the results for the purification of a ficti- 
tious protein, summarized in Table 4.1 and Figure 4.13. At each step, the 
following parameters are measured: 

Total protein. The quantity of protein present in a fraction is obtained by 
determining the protein concentration of a part of each fraction and multi- 
plying by the fraction^s total volume. 

Total activity. The enzyme activity for the fraction is obtained by measur- 
ing the enzyme activity in the volume of fraction used in the assay and mul- 
tiplying by the fraction s total volume. 


r 


TABLE 4.1 Quantification of a purification protocol for a fictitious protein 


Step 

Total protein 
(mg) 

Total activity 
(units) 

Specific activity, 
(units mg~^) 

Yield 

(%) 

Purification 

Icvcl 

Homogenization 

15,000 

150,000 

10 

100 

1 

Salt fractionation 

4,600 

138,000 

30 

92 

3 

lon-exchange chromatography 

1,278 

115,500 

90 

77 

9 

Molecular(exclusion) 

(chromatography) 

(6Õ) 

(75,000) 

(1,100) 


(TTo) 

Affinity chromatography 

1.75 

52,500 

30,000 

35 

3,000 

























Homogenate Sait lon exchange Molecular Affinity 

fractionation chromatography exclusion chromatography 
chromatography 
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FIGURE 4.13 Electrophoretic analysis 
of a protein purification. The 

purification scheme in Table 4.1 was 
analyzed by SDS-PAGE. Each lane contained 
50 jLtg of sample. The effectiveness of the 
purification can be seen as the band for 
the protein of interest becomes more 
prominent relative to other bands. 


Specific activity. This parameter is obtained by dividing total activity by 
total protein. 

Yield. This parameter is a measure of the activity retained after each 
purification step as a percentage of the activity in the crude extract. The 
amount of activity in the initial extract is taken to be 100%. 

Purification levei This parameter is a measure of the increase in purity and 
is obtained by dividing the specific activity, calculated after each purifica¬ 
tion step, by the specific activity of the initial extract. 

As we see in Table 4.1, the first purification step, salt fractionation, leads 
to an increase in purity of only 3-fold, but we reco ver nearly all the target 
protein in the original extract, given that the yield is 92%. After dialysis to 
lower the high concentration of salt remaining from the salt fractionation, 
the fraction is passed through an ion-exchange column. The purification 
now increases to 9-fold compared with the original extract, whereas the 
yield falis to 77%. Molecular exclusion chromatography brings the levei of 
purification to 100-fold, but the yield is now at 50%. The final step is affin¬ 
ity chromatography with the use of a ligand specific for the target enzyme. 
This step, the most powerful of these purification procedures, results in a 
purification levei of 3000-fold, while lowering the yield to 35%. The SDS- 
PAGE in Figure 4.13 shows that, if we load a constant amount of protein 
onto each lane after each step, the number of bands decreases in proportion 
to the levei of purification, and the amount of protein of interest increases 

as a proportion of the t otal protein present. __ 

A good purification ^scheme takes into account both^urification leveis) 
and yield. A high degree of purification and a poor yield leave little protein 
with which to experiment. A high yield with low purification leaves many 
contaminants (proteins other than the one of interest) in the fraction and 
complicates the interpretation of experiments. 

4.1.6 Ultracentrifugation Is Valuable for Separating 
Biomolecules and Determining Their Masses 

We have already seen that centrifugation is a powerful and generally appli- 
cable method for separating a crude mixture of cell components, but it is 
also useful for separating and analyzing biomolecules themselves. With this 



































































Density (g cm 


-1 I * 

CHAPTER 4 • Exploring Proteins TABLE 4.2 S values and molecular weights of sample proteins 


Protein 

S value 

(Svedberg units) 

Molecular 

weight 

Pancreatic trypsin inhibitor 

1 

6,520 

Cytochrome c 

1.83 

12,310 

Ribonuclease A 

1.78 

13,690 

Myoglobin 

1.97 

17,800 

Trypsin 

2.5 

23,200 

Carbonic anhydrase 

3.23 

28,800 

Goncanavlin A 

3.8 

51,260 

Malate dehydrogenase 

5.76 

74,900 

Lactate dehydrogenase 

7.54 

146,200 


From T. Creighton, Proteins, 2nd Edition (W. H. Freeman and Company, 1993), 
Table 7.1. 


technique, we can determine such parameters as mass and density, learn 
something about the shape of a molecule, and investigate the interactions 
between molecules. To deduce these properties from the centrifugation data, 
we need a mathematical description of how a particle behaves in a centrifu¬ 
gai force. 

A particle will move through a liquid médium when subjected to a cen¬ 
trifugai force. A convenient means of quantifying the rate of movement is 
to calculate the sedimentation coefficient, s, of a particle by using the fol¬ 
io wing equation: 

s — m{l — vp)/f 

where m is the mass of the particle, v is the partial specific volume (the re¬ 
ciprocai of the particle density), p is the density of the médium and /is the 
frictional coefficient (a measure of the shape of the particle). The (1 — vp) 
term is the buoyant force exerted by liquid médium. 

Sedimentation coefficients are usually expressed in {Svedberg units (S)^ 
equal to s. The smaller the S value, the slower a molecule moves in a 

centrifugai field. The S values for a number of biomolecules and cellular 
components are listed in Table 4.2 and Figure 4.14. 

Several important conclusions can be drawn from the preceding equation: 

1. The sedimentation velocity of a particle depends in 
part on its mass. A more massive particle sediments more 
rapidly than does a less massive particle of the same shape 
and density. 

2. Shape, too, influences the sedimentation velocity be- 
cause it affects the viscous drag. The frictional coefficient 
fofa compact particle is smaller than that of an extended 
particle of the same mass. Hence, elongated particles sed- 
iment more slowly than do spherical ones of the same 
mass. 

3. A dense particle moves more rapidly than does a less 
dense one because the opposing buoyant force (1 — vp) 
is smaller for the denser particle. 

4. The sedimentation velocity also depends on the den¬ 
sity of the solution. (p). Particles sink when vp < 1, float 
when vp > 1, and do not move when Fp = 1. 
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FIGURE 4.14 Density and sedimentation coefficients of 
cellular components. [After L J. Kleinsmith and V. M. Kish, Principies 
of Cell and Molecular Biology, 2d ed. (Harper Coilins, 1995), p. 138.] 
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A technique called zonal, band, or most commonly gradient centrifuga- 
tion can be used to separate proteins with different sedimentation coeffi- 
cients. The first step is to form a density gradient in a centrifuge tube. Dif- 
fering proportions of a low-density solution (such as 5% sucrose) and a 
high-density solution (such as 20% sucrose) are mixed to create a linear gra¬ 
dient of sucrose concentration ranging (from^20% at the bottom of the tube) 
(to 5% at the to^ (Figure 4.15). The role of the gradient is to prevent con- 
nective flow. A small volume of a solution containing the mixture of pro¬ 
teins to be separated is placed on top of the density gradient. When the 
rotor is spun, proteins move through the gradient and separate according to 
their sedimentation coefficients. The time and speed of the centrifugation 
is determined empirically. The separated bands, or zones, of protein can be 
harvested by making a hole in the bottom of the tube and collecting drops. 
The drops can be measured for protein content and catalytic activity or an- 
other functional property. This sedimentation-velocity technique readily 
separates proteins differing in sedimentation coefficient by a factor of two 
or more. 

The mass of a protein can be directly determined by sedimentation equi- 
librium, in which a sample is centrifuged at relatively low speed so that sed¬ 
imentation is counterbalanced by diffusion. The sedimentation-equilibrium 
technique for determining mass is very accurate and can be applied under non- 
denaturing conditions in which the native quaternary structure of multimeric 
proteins is preserved. In contrast, SDS—polyacrylamide gel electrophoresis 
(Section 4.1.4) provides an estimate oí the mass of dissociated polypeptide 
(chains under denaturing condition^ Note that, if we know the mass of the 
dissociated components of a multimeric protein as determined by SDS- 
polyacrylamide analysis and the mass of the intact multimeric protein as de¬ 
termined by sedimentation equilibrium analysis, we can determine how 
many copies of each polypeptide chain is present in the multimeric protein. 


FIGURE 4.15 Zonal centrifugation. The 

steps are as follows: (A) form a density 
gradient, (B) layer the sample on top of 
the gradient, (C) place the tube in a 
swinging-bucket rotor and centrifuge it, 
and (D) collect the samples. [After D. 
Freifelder, Physical Biochemistry, 2d ed. 

(W. H. Freeman and Company, 1982), p. 397.] 


4.1.7 The Mass of a Protein Can Be Precisely 
Determined by Mass Spectrometry 

Mass spectrometry has been an established analytical technique in organic 
chemistry for many years. Until recently, however, the very low volatility 
of proteins made mass spectrometry useless for the investigation of these 
molecules. This difficulty has been circumvented by the introduction of 
techniques for effectively dispersing proteins and other macromolecules 
into the gas phase. These methods are called matrix-assisted laser desorp- 
tion-ionization (MALDI) and electrospray spectrometry. We will focus on 
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FIGURE 4.16 MALDI-TOF mass 
spectrometry. (1) The protein sample, 
embedded in an appropriate matrix, is 
ionized by the appiication of a laser beam. 
(2) An electrical field accelerates the ions 
formed through the flight tube toward the 
detector. (3) The lightest ions arrive first. 
(4) The ionizing laser pulse aiso triggers a 
clock that measures the time of flight 
(TOF) for the ions. [After J. T. Watson, 
Introduction to Mass Spectrometry, 3d ed. 
(Lippincott-Raven, 1997), p. 279.] 


Beam splitter 



MALDI spectrometry. In this technique, protein ions are generated and 
then accelerated through an electrical field (Figure 4.16). They travei 
through the flight tube, with the smallest traveling fastest and arriving at 
the detector first. Thus, the time of flight (TOF) in the electrical field is a 
measure of the mass (or, more precisely, the mass/charge ratio). Tiny 
amounts of biomolecules, as small as a few picomoles (pmol) to femtomoles 
(fmol), can be analyzed in this manner. A MALDI-TOF mass spectrum 
for a mixture of the proteins insulin and j8-lactoglobulin is shown in Figure 
4.17. The masses determined by MALDI-TOF are 5733.9 and 18,364, re- 
spectively, compared with calculated values of 5733.5 and 18,388. MALDI- 
TOF is indeed an accurate means of determining protein mass. 


Insulin 

(I + H)+ = 5733.9 


FIGURE 4.17 MALDI-TOF mass 
spectrum of insulin and j3- 
lactoglobulin. A mixture of 5 pmol 
each of insulin (I) and /3-lactoglobulin (L) 
was ionized by MALDI, which produces 
predominately singly charged molecular 
ions from peptides and proteins (I + 
for insulin and L + for lactoglobulin). 
However, molecules with multiple charges 
as well as small quantities of a singly 
charged dimer of insulin, (2 I + H)^, aIso 
are produced. [After J. T. Watson, Introduction 
to Mass Spectrometry, 3d ed. (Lippincott-Raven, 
1997), p. 282.] 
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Mass spectrometry has permitted the development of peptide mass fin- I 91 | 

gerprinting. This technique for identifying peptides has greatly enhanced the Amino Acid Sequence Determination 

utility of two-dimensional gels. Two-dimensional electrophoresis is per- 
formed as described in Section 4.1.4. The sample of interest is extracted and 
cleaved specifically by chemical or enzymatic means. The masses of the pro- 
tein fragments are then determined with the use of mass spectrometry. Fi- 
nally, the peptide masses, or fingerprint, are matched against the fingerprint 
found in databases of proteins that have been ‘‘electronically cleaved’' by a 
Computer simulating the same fragmentation technique used for the exper¬ 
imental sample. This technique has provided some outstanding results. For 
example, of 150 yeast proteins analyzed with the use of two-dimensional 
gels, peptide mass fingerprinting unambiguously identified 80%. Mass 
spectrometry has provided name tags for many of the proteins in two- 
dimensional gels. 


4.2 AMINO ACID SEQUENCES CAN BE DETERMINED 
BY AUTOMATED EDMAN DEGRADATION 


The protein of interest having been purified and its mass determined, the 
next analysis usually performed is to determine the protein s amino acid se¬ 
quence, or primary structure. As stated previously (Section 3.2.1), a wealth 
of information about a protein s function and evolutionary history can of- 
ten be obtained from the primary structure. Let us examine first how we 
can sequence a simple peptide, such as 

Ala - Gly - A sp - Phe - Ar g - Gly 

The first step is to determine the amino acid composition of the peptide. 
The peptide is hydrolyzed into i ts constituent amino acids by heating it in 
6 N HCl at 110°C for 24 hours.( Amino acids in hydrolysates )can be sepa- 
rated by (ion-exchange chromatography on columns of sulfonated poly-) 
(styrene.) The identity of the amino acid is revealed by its elution volume, 
which is the volume of buffer used to remove the amino acid from the col- 
umn (Figur e 4.18), and quantified by reaction wit h ninhydrin. Amino acids 
treated with( n inhydrin give an intense blue color,) except for proline, which 
gives a (yellow) color because it contains a secondary amino group. The con- 
centration of an amino acid in a solution, after heating with ninhydrin, is 
proportional to the optical absorbance of the solution. This technique can 
detect a microgram (10 nmol) of an amino acid, which is about the amount 
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FIGURE 4.18 Determination of amino 
acid composition. Different amino acids 
in a peptide hydrolysate can be separated 
by ion-exchange chromatography on a 
sulfonated polystyrene resin (such as 
Dowex-50). Buffers (in this case, sodium 
citrate) of increasing pH are used to elute 
the amino acids from the column. The 
amount of each amino acid present is 
determined from the absorbance. 
Aspartate, which has an acidic side chain, 
is first to emerge, whereas arginine, which 
has a basic side chain, is the last. The 
original peptide is revealed to be 
composed of one aspartate, one alanine, 
one phenylalanine, one arginine, and two 
glycine residues. 
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FIGURE 4.19 Fluorescent derívatives 
of amíno acids. Fluorescamine reacts 
with the ü!-annino group of an amino add 
to form a fluorescent derivative. 





Amine derivative 


present in a thumbprint. As little as a nanogram (10 pmol) of an amino acid 
can be detected by replacing ninhydrin with fluorescamine, which reacts with 
the n-amino group to form a highly fluorescent product (Figure 4.19). A 
comparison of the chromatographic patterns of our sample hydrolysate with 
that of a standard mixture of amino acids would show that the amino acid 
composition of the peptide is 

(Ala, Arg, Asp, Gly 2 , Phe) 

The parentheses denote that this is the amino acid composition of the pep¬ 
tide, not its sequence. 

The next step is often to identify the N-terminal amino acid by labeling 
it with a compound that forms a stable covalent bond. Fluorodinitrobenzene 
(FDNB) was first used for this purpose by Frederick Sanger. Dabsyl chlo- 
ride is now commonly used because it forms fluorescent derivatives that can 
be detected with high sensitivity. It reacts with an uncharged n-NH 2 group 
to form a sulfonamide derivative that is stable under conditions that hy- 
drolyze peptide bonds (Figure 4.20). Hydrolysis of our sample dabsyl- 
peptide in 6 N HCl would yield a dabsyl-amino acid, which could be iden- 
tified as dabsyl-alanine by its chromatographic properties. Dansyl chloride, 
too, is a valuable labeling reagent because it forms fluorescent sulfonamides. 




Dabsyl chloride 


H3C. /CH3 



Although the dabsyl method for determining the amino-terminal 
residue is sensitive and powerful, it cannot be used repeatedly on the same 
peptide, because the peptide is totally degraded in the acid-hydrolysis step 
and thus all sequence information is lost. Pehr Edman devised a method for 
labeling the amino-terminal residue and cleaving it from the peptide with- 
out disrupting the peptide bonds between the other amino acid residues. 
The Edman degradation sequentially removes one residue at a time from the 
amino end of a peptide (Figure 4.21). Phenyl isothiocy anate reacts with the 
uncharged terminal amino group of the peptide to form a phenylthiocar- 
bamoyl derivative. Then, under mildly acidic conditions, a cyclic derivative 
of the terminal amino acid is liberated, which leaves an intact peptide short- 
ened by one amino acid. The cyclic compound is a phenylthiohydantoin 
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Gly — Asp — Phe — Arg — Gly 


Amino Add Sequence Determination 


Dabsyl chioride 


Labeling 



Dabsyl alanine 


FIGURE 4.20 Determination of the amino-terminal residue of a peptide. Dabsyl 
chioride labeis the peptide, which is then hydrolyzed with the use of hydrochioric acid. The 
dabsyl-amino acid (dabsyl-alanine in this example) is identified by its chromatographic 
characteristics. 
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FIGURE 4.21 The Edman degradation. The labeled amino-terminal residue (PTH-alanine 
in the first round) can be released without hydrolyzing the rest of the peptide. Hence, the 
amino-terminal residue of the shortened peptide (Gly-Asp-Phe-Arg-Gly) can be determined 
in the second round. Three more rounds of the Edman degradation reveal the complete 
sequence of the original peptide. 































FIGURE 4.22 Separation of PTH-amino 
acids. PTH-amino adds can be rapidiy 
separated by high-pressure liquid 
chromatography (HPLC). In this HPLC 
profile, a mixture of PTH-amino acids is 
clearly resolved into its components. An 
unknown amino acid can be identified 
by its elution position relative to the 
known ones. 


(PTH)-amino acid, which can be identified by chromatographic proce- 
dures. The Edman procedure can then be repeated on the shortened pep- 
tide, yielding another PTH-amino acid, which can again be identified by 
chromatography. Three more rounds of the Edman degradation will reveal 
the complete sequence of the original peptide pentapeptide. 

The development of automated sequencers has markedly decreased the 
time required to determine protein sequences. One cycle of the Edman 
degradation—the cleavage of an amino acid from a peptide and its identi- 
fication—is carried out in les s than 1 hou r. By repeated degradations, the 
amino acid sequence of some (^ residues) in a protein can be determined. 
High-pressure liquid chromatography provides a sensitive means of distin- 
guishing the various amino acids (Figure 4.22). Gas-phase sequenators can 
analyze picomole quantities of peptides and proteins. This high sensitivity 
makes it feasible to analyze the sequence of a protein sample eluted from a 
single band of an SDS-polyacrylamide gel. 

4.2.1 Proteins Can Be Specifically Cleaved into Small Peptides 
to Facilitate Analysis 

In principie, it should be possible to sequence an entire protein by using the 
Edman method. In practice, the peptides cannot be much longer than about 
50 residues. This is so because the reactions of the Edman method, espe- 
cially the release step, are not 100% efficient, and so not all peptides in the 
reaction mixture release the amino acid derivative at each step. For instance, 
if the efficiency of release for each round were 98%, the proportion of ‘‘cor- 
rect'' amino acid released after 60 rounds would be (0.98^^), or 0.3—a hope- 
lessly impure mix. This obstacle can be circumvented by cleaving the orig¬ 
inal protein at specific amino acids into smaller peptides that can be 
sequenced. In essence, the strategy is to divide and conquer. 

Specific cleavage can be achieved by chemical or enzymatic methods. 
For example, cyanogen bromide (CNBr) splits polypeptide chains only on 
the carboxyl side of methionine residues (Figure 4.23). 


FIGURE 4.23 Cleavage 
by cyanogen bromide. 

Cyanogen bromide cleaves 
polypeptides on the carboxyl 
side of methionine residues. 



A protein that has 10 methionine residues will usually yield 11 peptides on 
cleavage with CNBr. Highly specific cleavage is also obtained with trypsin, 
a proteolytic enzyme from pancreatic juice. Trypsin cleaves polypeptide 
chains on the carboxyl side of arginine and lysine residues (Figure 4.24 and 
Section 9.1.4). A protein that contains 9 lysine and 7 arginine residues will 
usually yield 17 peptides on digestion with trypsin. Each of these tryptic 


FIGURE 4.24 Cleavage by 
trypsin. Trypsin hydrolyzes 
polypeptides on the carboxyl 
side of arginine and lysine 
residues. 
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TABLE 4.3 Specific cleavage of polypeptides 


Reagent 

Cleavage site 

Chemical cleavage 

Cyanogen bromide 

Carboxyl side of methionine residues 

0- lodosobenzoate 

Carboxyl side of tryptophan residues 

Hydroxylamine 

Asparagine-glycine bonds 

2 - Nitro - 5 - thiocyanobenzoate 

Amino side of cysteine residues 

Enzymatic cleavage 

(Trypsin) 

(Carboxyl side of lysine and arginine) 

(residues) 

Glostripain 

Carboxyl side of arginine residues 

Staphylococcal protease 

Carboxyl side of aspartate and 
glutamate residues (glutamate only 
under certain conditions) 

Thrombin 

Carboxyl side of arginine 

(Chymotrypsin) 

(Carboxyl side of tyrosine, tryptophan,) 

Tphenylalanine, leucine, and) 
(methionine) 

Carboxypeptidase A 

Amino side of C-terminal amino acid 
(not arginine, lysine, or proline) 
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peptides, except for the carboxyl-terminal peptide of the protein, will end 
with either arginine or lysine. Table 4.3 gives several other ways of specif- 
ically cleaving polypeptide chains. 

The peptides obtained by specific chemical or enzymatic cleavage are 
separated by some type of chromatography. The sequence of each purified 
peptide is then determined by the Edman method. At this point, the amino 
acid sequences of segments of the protein are known, but the order of these 
segments is not yet defined. How can we order the peptides to obtain the 
primary structure of the original protein? The necessary additional infor- 
mation is obtained from overlap peptides (Figure 4.25). A second enzyme is 
used to split the polypeptide chain at different linkages. For example, chy- 
motrypsin cleaves preferentially on the carboxyl side of aromatic and some 
other bulky nonpolar residues (Section 9.1.3). Because these chymotryptic 
peptides overlap two or more tryptic peptides, they can be used to establish 
the order of the peptides. The entire amino acid sequence of the polypep¬ 
tide chain is then known. 


Tryptic peptides Chymotryptic peptide 

Ala — Ala — Trp — Gly — Lys Vai — Lys — Ala — Ala — Trp 

Thr —Phe—Vai —Lys 

_ Tryptic peptide _ Tryptic peptide _ 

Thr — Phe—Vai — Lys — Ala — Ala — Trp — Gly — Lys 
Chymotryptic overlap peptide 


FIGURE 4.25 Overlap peptides. 

The peptide obtained by chymotrypic 
digestion overlaps two tryptic peptides, 
establishing their order. 


Additional steps are necessary if the initial protein sample is actually sev¬ 
eral polypeptide chains. SDS-gel electrophoresis under reducing conditions 
should display the number of chains. Alternatively, the number of distinct 
N-terminal amino acids could be determined. For a protein made up of two 
or more polypeptide chains held together by noncovalent bonds, denatur- 
ing agents, such as urea or guanidine hydrochloride, are used to dissociate 
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FIGURE 4.26 Dísuifíde-bond reduction. 

Polypeptides linked by disulfide bonds 
can be separated by reduction with 
dithiothreitol followed by alkylation to 
prevent reformation. 


the chains from one another. The dissociated chains must be separated from 
one another before sequence determination of the individual chains can be- 
gin. Polypeptide chains linked by disulfide bonds are separated by reduc¬ 
tion with thiols such as |3-mercaptoethanol or dithiothreitol. To prevent the 
cysteine residues from recombining, they are then alkylated with iodoac- 
etate to form stable S-carboxymethyl derivatives (Figure 4.26). Sequencing 
can then be performed as heretofore described. 

To complete our understanding of the protein’s structure, we need to de¬ 
termine the positions of the original disulfide bonds. This information can 
be obtained by using a diagonal electrophoresis technique to isolate the pep- 
tide sequences containing such bonds (Figure 4.27). First, the protein is 
specifically cleaved into peptides under conditions in which the disulfides 
remain intact. The mixture of peptides is applied to a corner of a sheet of 
paper and subjected to electrophoresis in a single lane along one side. The 
resulting sheet is exposed to vapors of performic acid, which cleaves disul¬ 
fides and converts them into cysteic acid residues. Peptides originally linked 
by disulfides are now independent and more acidic because of the forma- 
tion of an S 03 ~ group. 




SOs-^-OjS 



NH 


Cysteic acid 


This mixture is subjected to electrophoresis in the perpendicular direction 
under the same conditions as those of the first electrophoresis. Peptides that 
were devoid of disulfides will have the same mobility as before, and conse- 
quently all will be located on a single diagonal line. In contrast, the newly 
formed peptides containing cysteic acid will usually migrate differently from 
their parent disulfide-linked peptides and hence will lie off the diagonal. 
These peptides can then be isolated and sequenced, and the location of the 
disulfide bond can be established. 
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FIGURE 4.27 Diagonal electrophoresis. 

Peptides joined together by disulfide 
bonds can be detected by diagonal 
electrophoresis. The mixture of peptides 
is subjected to electrophoresis in a single 
lane in one direction (horizontal) and 
then treated with performic acid, which 
cleaves and oxidizes the disulfide 
bonds. The sample is then subjected to 
electrophoresis in the perpendicular 
direction (vertical). 


4.2.2 Amino Acid Sequences Are Sources 
of Many Kinds of Insight 

A proteínas amino acid sequence, once determined, is a valuable source of 
insight into the protein s function, structure, and history. 

1. The sequence of a protein of interest can be compared with all other known 
sequences to ascertain whether significant similarities exist. Does this protein 
belong to one of the established families? A search for kinship between a newly 
sequenced protein and the thousands of previously sequenced ones takes 











only a few seconds on a personal Computer (Section 7.2). If the newly iso- 
lated protein is a member of one of the established classes of protein, we can 
begin to infer information about the protein s function. For instance, chy- 
motrypsin and trypsin are members of the serine protease family, a clan of 
proteolytic enzymes that have a common catalytic mechanism based on a 
reactive serine residue (Section 9.1.4). If the sequence of the newly isolated 
protein shows sequence similarity with trypsin or chymotrypsin, the result 
suggests that it may be a serine protease. 

2. Comparison of sequences of the same protein in different species yields a 
wealth of information about evolutionary pathways. Genealogical relations 
between species can be inferred from sequence differences between their 
proteins. We can even estimate the time at which two evolutionary lines di- 
verged, thanks to the clocklike nature of random mutations. For example, a 
comparison of serum albumins found in primates indicates that human be- 
ings and African apes diverged 5 million years ago, not 30 million years ago 
as was once thought. Sequence analyses have opened a new perspective on 
the fóssil record and the pathway of human evolution. 

3. Amino acid sequences can be searched for the presence of internai repeats. 
Such internai repeats can reveal information about the history of an indi¬ 
vidual protein itself. Many proteins apparently have arisen by duplication 
of a primordial gene followed by its diversification. For example, calmod- 
ulin, a ubiquitous calcium sensor in eukaryotes, contains four similar 
calcium-binding modules that arose by gene duplication (Figure 4.28). 

4. Many proteins contain amino acid sequences that serve as signals desig- 
nating their destinations or controlling their processing. A protein destined for 
export from a cell or for location in a membrane, for example, contains a 
signal sequence, a stretch of about 20 hydrophobic residues near the amino 
terminus that directs the protein to the appropriate membrane. Another 
protein may contain a stretch of amino acids that functions as a nuclear lo- 
calization signal, directing the protein to the nucleus. 

5. Sequence data provide a basis for preparing antibodies specific for a pro¬ 
tein of interest. Careful examination of the amino acid sequence of a protein 
can reveal which sequences will be most likely to elicit an antibody when 
injected into a mouse or rabbit. Peptides with these sequences can be syn- 
thesized and used to generate antibodies to the protein. These specific an¬ 
tibodies can be very useful in determining the amount of a protein present 
in solution or in the blood, ascertaining its distribution within a cell, or 
cloning its gene (Section 4.3.3). 

6. Amino acid sequences are valuable for making DNA probes that are spe- 
cificfor the genes encoding the corresponding proteins (Section 6.1.4). Knowl- 
edge of a protein s primary structure permits the use of reverse genetics. 
DNA probes that correspond to a part of the amino acid sequence can be 
constructed on the basis of the genetic code. These probes can be used to 
isolate the gene of the protein so that the entire sequence of the protein can 
be determined. The gene in turn can provide valuable information about 
the physiological regulation of the protein. Protein sequencing is an inte¬ 
gral part of molecular genetics, just as DNA cloning is central to the analy- 
sis of protein structure and function. 

4.2.3 Recombinant DNA Technology Has Revolutionized 
Protein Sequencing 

Hundreds of proteins have been sequenced by Edman degradation of pep¬ 
tides derived from specific cleavages. Nevertheless, heroic effort is required 
to elucidate the sequence of large proteins, those with more than 1000 
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in a protein chain. Calmodulin, a 
calcium sensor, contains four similar units 
in a single polypeptide chain shown in 
red, yellow, blue, and orange. Each unit 
binds a calcium ion (shown in green). 




I 98 I residues. For sequencing such proteins, a complementary experimental ap- 

CHAPTER 4 • Exploring Proteins proach based on recombinant DNA technology is often more efficient. As 

will be discussed in Chapter 6, long stretches of DNA can be cloned and 
sequenced, and the nucleotide sequence directly reveals the amino acid se- 
quence of the protein encoded by the gene (Figure 4.29). Recombinant 
DNA technology is producing a wealth of amino acid sequence information 
at a remarkable rate. 


DNA sequence gGG I TTC I TTG I GGA I GCA | GCA | GGA | AGC | ACT | ATG | GGC | GCA | 


Amino acid 
sequence 


Gly Phe Leu Gly Ala Ala Gly Ser Thr Met Gly Ala 

FIGURE 4.29 DNA sequence yields the amino acid sequence. The complete nucleotide 
sequence of HIV-1 (human immunodeficiency virus), the cause of AIDS (acquired immune 
deficiency syndrome), was determined within a year after the isolation of the virus. A part 
of the DNA sequence specified by the RNA genome of the virus is shown here with the 
corresponding amino acid sequence (deduced from a knowledge of the genetic code). 


Even with the use of the DNA base sequence to determine primary struc- 
ture, there is still a need to work with isolated proteins. The amino acid 
sequence deduced by reading the DNA sequence is that of the nascent pro¬ 
tein, the direct product of the translational machinery. Many proteins are 
modified after synthesis. Some have their ends trimmed, and others arise 
by cleavage of a larger initial polypeptide chain. Cysteine residues in some 
proteins are oxidized to form disulfide links, connecting either parts within 
a chain or separate polypeptide chains. Specific side chains of some proteins 
are altered. Amino acid sequences derived from DNA sequences are rich in 
information, but they do not disclose such posttranslational modifications. 
Chemical analyses of proteins in their final form are needed to delineate the 
nature of these changes, which are criticai for the biological activities of most 
proteins. Thus, genomic andproteomic analyses are complementary approaches 
to elucidating the structural basis of protein function. 


4.3 IIVIIVIUNOLOGY PROVIDES IMPORTANT TECHNIQUES 
WITH WHICH TO INVESTIGATE PROTEINS 

Immunological methods hav e become important tools used to purify a pro¬ 
tein, locate it in the cell, or (quantify) how much of the protein is present. 
These methods are predicated on the exquisite specificity of antibodies for 
their target proteins. Labeled antibodies provide a means to tag a specific 
protein so that it can be isolated, quantified, or (visualized.) 

4.3.1 Antibodies to Specific Proteins Can Be Generated 

Immunological techniques begin with the generation of antibodies to a par¬ 
ticular protein. An antibody (also called an immunoglobulin, Ig) is a protein 
synthesized by an animal in response to the presence of a foreign substance, 
called an antigen, and normally functions to protect the animal from infec- 
tion (Chapter 33). Antibodies have s p ec ific and high affinity for the anti- 
gens that elicited their synthe sis. í Proteins, polysaccharides, and nucleic) 
(acids can be effective antigens.) An antibody recognizes a specific group or 
cl uster of amino acids on a large molecule called an antigenic determinant, 
or [epitope) (Figures 4.30 and 4.31). Small foreign molecule s, such as syn- 
thetic peptides, also can elicit antibodies, provided that thej small molecule ; 
(contains a recognized epitope) and is attache d to a ma cromolecular carrier. 
The small foreign molecule itself is called a (hapten.) Animais have a very 
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large repertoire of antibody-producing cells, each producing an antibody of 
a single specificity. An antigen acts by stimulating the proliferation of the 
small number of cells that were already forming an antibody capable of rec- 
ognizing the antigen (Chapter 33). 

Immunological techniques depend on our being able to generate anti- 
bodies to a specific antigen. To obtain antibodies that recognize a particu¬ 
lar protein, a biochemist injects the protein into a rabbit twice, 3 weeks apart. 




FIGURE 4.30 Antibody structure. 

(A) IgG antibodies consist of four chains, 
two heavy chains (blue) and two light 
chains (red), linked by disulfide bonds. 

The heavy and light chains come together 
to form Fab domains, which have the 
antigen-binding sites at the ends. The two 
heavy chains form the Fc domain. The Fab 
domains are linked to the Fc domain by 
flexible linkers. (B) A more schematic 
representation of an IgG molecule. 





FIGURE 4.31 Antigen-antibody 
ínteractions. A protein antigen, in this 
case lysozyme, binds to the end of an 
Fab domain from an antibody. The end 
of the antibody and the antigen have 
complementary shapes, allowing a large 
amount of surface to be buried on binding. 









Polyclonal Antibodies 



Monoclonal Antibodies 



FIGURE 4.32 Polyclonal and 
monoclonal antibodies. Most antigens 
have several epitopes. Polyclonal 
antibodies are heterogeneous mixtures of 
antibodies, each specific for one of the 
various epitopes on an antigen. 
Monoclonal antibodies are all identical, 
produced by clones of a single antibody- 
producing cell. They recognize one specific 
epitope. [After R. A. Goldsby, T. J. Kindt, B. A. 
Osborne, Kuby Immunology, 4th ed. (W. H. 
Freeman and Connpany, 2000), p. 154.] 


The injected protein stimulates the reproduction of cells producing anti¬ 
bodies that recognize the foreign substance. Blood is drawn from the im- 
munized rabbit several weeks later and centrifuged to separate blood cells 
from the supernatant, or serum. The serum, called an antiserum, contains 
antibodies to all antigens to which the rabbit has been exposed. Only some 
of them will be antibodies to the injected protein. Moreover, antibodies of 
a given specificity are not a single molecular species. For instance, 2,4- 
dinitrophenol (DNP) has been used as a hapten to generate antibodies to 
DNP. Analyses of anti-DNP antibodies revealed a wide range of binding 
affinities—the dissociation constants ranged from about 0.1 nM to 1 jaM. 
Correspondingly, a large number of bands were evident when anti-DNP 
antibody was subje cted to isoelectric focusing. These results indicate that 
cells are producing^ many different antibodies, ^ach recognizing a differenf) 
I surface feature of the same antigen.)The antibodies are heterogeneous, or 
(polyclonal) (Figme 4.32). This heterogeneity is a barrier, which can compli- 
cate the use of these antibodies. 


4.3.2 Monoclonal Antibodies with Virtually Any Desired 
Specificity Can Be Readily Prepared 

The discovery of a means of producing monoclonal antibodies of virtually 
any desired specificity was a major breakthrough that intensified the power 
of immunological approaches. Just as working with impure proteins makes 
it difficult to interpret data and understand function, so too does working 
with an impure mixture of antibodies. The ideal would be to isolate a clone 
of cells that produce only a single antibody. The problem is that antibody- 
producing cells isolated from an organism die in a short time. 

Immortal cell lines that produce monocl onal antibodies do exist. These 
cell lines are derived from a type of câncer, {multiple myeloma^ a. malignant 
disorder of antibody-producing cells. In this câncer, a single transformed 
plasma cell divides uncontrollably, generating a very large number of cells 
of a single kind. They are a clone because they are descended from the same 
cell and have identical properties. The identical cells of the myeloma secrete 
large amounts of normal immunoglobulin of a single kind generation after 
generation. A myeloma can be transplanted from one mouse to another, 
where it continues to proliferate. These antibodies were useful for elucidat- 
ing antibody structure, but nothing is known about their specificity and so 
they are useless for the immunological methods described in the next pages. 

Cesar Milstein and Georges Kòhler discovered that large amounts of 
homogeneous antibody of nearly any desired specificity could be obtained by 
fusing a short-lived antibody-producing cell with an immortal myeloma cell. 
An antigen is inject ed into a mouse, and its spleen is removed several weeks 
later (Figure 4.33).( A mixture of plasma cells from this spleen is fused in) 
ívitro with myeloma cells. Each of the resulting hybrid cells, cslledhybridoma) 
(cen5,) indefinitely produces homogeneous antibody specified by the parent 

cell from the spleen. Hybridoma cells can then be screened, by using some 
sort of assay for the antigen-antibody interaction, to determine which ones 
produce antibody having the desired specificity. Collections of cells shown 
to produce the desired antibody are subdivided and reassayed. This process 
is repeated until a pure cell line, a clone producing a single antibody, is iso¬ 
lated. These positive cells can be grown in culture médium or injected into 
mice to induce myelomas. Alternatively, the cells can be frozen and stored 
for long periods. 

The hybridoma method of producing monoclonal antibodies has opened 
new vistas in biology and medicine. Large amounts of homogeneous antibod- 
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FIGURE 4.33 Preparation of 
monoclonal antibodies. (Hybridomi) 
cells are formed by fusion of Cantibo^j 
( producin^cells) and (myeloma cells.) The 
hybrid cells are allowed to proliferate by 
growing them in selective médium. They 
are then screened to determine which 
ones produce antibody of the desired 
specificity. [After C. Milstein. Monoclonal 
antibodies. Copyright © 1980 by Scientific 
American, Inc. All rights reserved.] 


ies with tailor-made specificities can be readily prepared. They are sources of 
insight into relations between antibody structure and specificity Moreover, 
monoclonal antibodies can serve as precise analy tical and preparative reagents. 
For example, a pure antibody can be obtained against an antigen that has 
not yet been isolated (Section 4.4). Proteins that guide development have 
been identified with the use of monoclonal antibodies as tags (Figure 4.34). 
(Monoclonal antibodies attached to solid supports can be used as affinity) 

(columns to purify scarce proteins.) This method has been used to purify in- 
terferon (an antiviral protein) 5000-fold from a crude mixture. Clinicai lab- 
oratories are using monoclonal antibodies in many assays. For example, the 
detection in blood of isozymes that are normally localized in the heart points 
to a myocardial infarction (heart attack). Blood transfusions have been made 
safer by antibody screening of donor blood for viruses that cause AIDS (ac- 
quired immune deficiency syndrome), hepatitis, and other infectious dis- 
eases. Monoclonal antibodies are also being evaluated for use as therapeu- 
tic agents, as in the treatment of câncer. Furthermore, the vast repertoire of 
antibody specificity can be tapped to generate catalytic antibodies having 
novel features not found in naturally occurring enzymes. 

4.3.3 Proteins Can Be Detected and Quantitated by Using 
an Enzyme-Linked Immunosorbent Assay 

Antibodies can be used as exquisitely specific analytic reagents to quantify 
the amount of a protein or other antigen. The technique is the enzyme-linked 



FIGURE 4.34 Fluorescence mícrograph 
of a developing Drosophíta embryo. 

The embryo was stained with a 
fluorescent-labeled monoclonal antibody 
for the DNA-binding protein encoded by 
engrailed, an essential gene in specifying 
the body plan. [Courtesy of Dr. Nipam Patel 
and Dr. Corey Goodman.] 








































I 102 I immunosorbent assay (ELISA). In this method, an enzyme, which reacts with 

CHAPTER 4 • Exploring Proteins a colorless substrate to produce a colored product, is covalently linked to a 

specific antibody that recognizes a target antigen. If the antigen is present, 
the antibody-enzyme complex will bind to it, and the enzyme component 
of the antibody-enzyme complex will catalyze the reaction generating the 
colored product. Thus, the presence of the colored product indicates the 
presence of the antigen. Such an enzyme-linked immunosorbent assay, 
which is rapid and convenient, can detect less than a nanogram (10~^ g) of 
a protein. ELISA can be performed with either polyclonal or monoclonal 
antibodies, but the use of monoclonal antibodies yields more reliable results. 

7 We will consider two among the several types of ELISA. The indi- 
rect ELISA is used to detect the presence of antibody and is the basis 
of the test for HIV infection. In that test, virai core proteins (the antigen) 
are absorbed to the bottom of a well. Antibodies from a patient are then 
added to the coated well and allowed to bind to the antigen. Finally, en¬ 
zyme-linked antibodies to human antibodies (for instance, goat antibodies 
that recognize human antibodies) are allowed to react in the well and un- 
bound antibodies are removed by washing. Substrate is then applied. An 
enzyme reaction suggests that the enzyme-linked antibodies were bound to 
human antibodies, which in turn implies that the patient had antibodies to 
the virai antigen (Figure 4.35). 

The Sandwich ELISA allows both the detection and the quantitation of 
antigen. Antibody to a particular antigen is first absorbed to the bottom of 
a well. Next, the antigen (or blood or urine containing the antigen) is added 
to the well and binds to the antibody. Finally, a second, different antibody 
to the antigen is added. This antibody is enzyme linked and is processed as 
described for indirect ELISA. In this case, the extent of reaction is directly 
proportional to the amount of antigen present. Consequently, it permits the 
measurement of small quantities of antigen (see Figure 4.35). 


(A) Indirect ELISA 



Antigen- Specific antibody 

coated well binds to antigen 


(B) Sandwich ELISA 


Enzyme-linked 
antibody binds to 
specific antibody 


Substrate is added and 
converted by enzyme into 
colored product; the rate 
of color formation is 
proportional to the amount 
of specific antibody 



Wash 



Wash 



Wash 
-> 



Monoclonal 
antibody- 
coated well 


Antigen binds A second monoclonal Substrate is added and 

to antibody antibody, linked to converted by enzyme into 

enzyme, binds to colored product; the rate 

immobilized antigen of color formation is 

proportional to the 
amount of antigen 


FIGURE 4.35 Indirect ELISA and sandwích ELISA (A) In indirect ELISA, the production 
of color indicates the amount of an antibody to a specific antigen. (B) In sandwich ELISA, 
the production of color indicates the quantity of antigen. [After R. A. Goldsby, T. J. Kindt, B. A. 
Osborne, Kuby Immunology, 4th ed. (W. H. Freeman and Connpany, 2000), p. 162.] 



























































4.3.4 Western Blotting Permits the Detection of Proteins 
Separated by Gel Electrophoresis 

Often it is necessary to detect small quantities of a particular protein in the 
presence of many other proteins, such as a virai protein in the blood. Very 
small quantities of a protein of interest in a cell or in body fluid can be de- 
tected by an immunoassay technique called Western blotting (Figure 4.36). 
A sample is subjected to electrophoresis on an SDS-polyacrylamide gel. 
Blotting (or more typically electroblotting) transfers the resolved proteins 
on the gel to the surface of a polymer sheet to make them more accessible 
for reaction. An antibody that is specific for the protein of interest is added 
to the sheet and reacts with the antigen. The antibody-antigen complex on 
the sheet then can be detected by rinsing the sheet with a second antibody 
specific for the first (e.g., goat antibody that recognizes mouse antibody). 
A radioactive labei on the second antibody produces a dark band on x-ray 
film (an autoradiogram). Alternatively, an enzyme on the second antibody 
generates a colored product, as in the ELISA method. Western blotting 
makes it possible to find a protein in a complex mixture, the proverbial nee- 
dle in a haystack. It is the basis for the test for infection by hepatitis C, where 
it is used to detect a core protein of the virus. This technique is also very 
useful in the cloning of genes. 
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FIGURE 4.36 Western blotting. 

Proteins on an SDS-polyacrylamide gel 
are transferred to a polymer sheet and 
stained with radioactive antibody. A band 
corresponding to the protein to which 
the antibody binds appears in the 
autoradiogram. 
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4.3.5 Fluorescent Markers Make Possible the Visualization 
of Proteins in the Cell 

Biochemistry is often performed in test tubes or polyacrylamide gels. How- 
ever, most proteins function in the context of a cell. Fluorescent markers 
provide a powerful means of examining proteins in their biological context. 
For instance, cells can be stained with fluorescence-labeled antibodies or 
other fluorescent proteins and examined hy fluorescence microscopy to reveal 
the location of a protein of interest. Arrays of parallel bundles are evident 
in cells stained with antibody specific for actin, a protein that polymerizes 
into filaments (Figure 4.37). Actin filaments are constituents of the cy- 
toskeleton, the internai scaffolding of cells that Controls their shape and 
movement. By tracking protein location, fluorescent markers also provide 
clues to protein function. For instance, the glucocorticoid receptor protein 
is a transcription factor that Controls gene expression in response to the 
steroid hormone cortisone. The receptor was linked to green fluorescent 



FIGURE 4.37 Actin filaments. 

Fluorescence micrograph of actin filaments 
in a cell stained with an antibody specific 
to actin. [Courtesy of Dr. Elias Lazarides.] 























FIGURE 4.38 Nuclear localízation of 
a steroíd receptor. (A) The receptor, 
made visible by attachment of the 
green fluorescent protein, is located 
predominantiy in the cytoplasm of the 
cultured cell. (B) Subsequent to the 
addition of corticosterone (a glucocorticoid 
steroid), the receptor moves into the 
nucleus. [Courtesy of Professor William B. 
Pratt/Department of Pharmacology, University 
of Michigan.] 



(B) 






FIGURE 4.39 Immunoelectron 
mícroscopy. The opaque particles (150-Â, 
or 15-nm, diameter) in this electron 
micrograph are clusters of gold atoms 
bound to antibody molecules. These 
membrane vesicles from the synapses of 
neurons contain a channel protein that is 
recognized by the specific antibody. 
[Courtesy of Dr. Peter Sargent] 


protein (GPF), a naturally fluorescent protein isolated from the jellyfish 
Aequorea victoria (Section 3.6.5). Fluorescence microscopy revealed that, in 
the absence of the hormone, the receptor is located in the cytoplasm 
(Figure 4.38A). On addition of the steroid, the receptor is translocated to 
the nucleus, where it binds to DNA (Figure 4.38B). 

The highest resolution of fluorescence microscopy is about 0.2 jam 
(200 nm, or 2000 A), the wavelength of visible light. Finer spatial resolution 
can be achieved by electron microscopy by using antibodies tagged with 
electron-dense markers. For example, ferritin conjugated to an antibody 
can be readily visualized by electron microscopy because it contains an 
electron-dense core rich in iron. Clusters of gold also can be conjugated to 
antibodies to make them highly visible under the electron microscope. 
Immunoelectron microscopy can define the position of antigens to a resolu¬ 
tion of 10 nm (100 Â) or finer (Figure 4.39). 


4.4 PEPTIDES CAN BE SYNTHESIZED BY AUTOMATED 
SOLID-PHASE METHODS 

The ability to synthesize peptides of defined sequence is a powerful tech- 
nique for extending biochemical analysis for several reasons. 

1. Synthetic peptides can serve as antigens to stimulate the formation of spe- 
cific antibodies. For instance, as discussed earlier, it is often more efficient 
to obtain a protein sequence from a nucleic acid sequence than by se- 
quencing the protein itself (see also Chapter 6). Peptides can be synthesized 
on the basis of the nucleic acid sequence, and antibodies can be raised that 






target these peptides. These antibodies can then be used to isolate the in- 
tact protein from the cell. 

2. Synthetic peptides can be used to isolate receptors for many hormones and 
other signal molecules. For example, white blood cells are attracted to bac¬ 
téria by formylmethionyl (fMet) peptides released in the breakdown of 
bacterial proteins. Synthetic formylmethionyl peptides have been useful in 
identifying the cell-surface receptor for this class of peptide. Moreover, 
synthetic peptides can be attached to agarose beads to prepare affinity 
chromatography columns for the purification of receptor proteins that 
specifically recognize the peptides. 

T 3. Synthetic peptides can serve as drugs. Vasopressin is a peptide hor- 
mone that stimulates the reabsorption of water in the distai tubules 
of the kidney, leading to the formation of more concentrated urine. Patients 
with diabetes insipidus are deficient in vasopressin (also called antidiuretic 
hormone), and so they excrete large volumes of urine (more than 5 liters per 
day) and are continually thirsty. This defect can be treated by administer- 
ing 1 -desamino-8-D-arginine vasopressin, a synthetic analog of the missing 
hormone (Figure 4.40). This synthetic peptide is degraded in vivo much 
more slowly than vasopressin and, additionally, does not increase the blood 
pressure. 




4. Finally, studying synthetic peptides can help define the rules governing the 
three-dimensional structure of proteins. We can ask whether a particular se- 
quence by itself folds into an a helix, j8 strand, or hairpin turn or behaves 
as a random coil. 

How are these peptides constructed? The amino group of one amino acid 
is linked to the carboxyl group of another. However, a unique product is 
formed only if a single amino group and a single carboxyl group are avail- 
able for reaction. Therefore, it is necessary to block some groups and to 
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FIGURE 4.40 Vasopressin and synthetic 
vasopressin. Structural formulas of 

(A) vasopressin, a peptide hormone 
that stimulates water resorption, and 

(B) l-desamino-8-D-arginine vasopressin, 
a more stable synthetic analog of this 
antidiuretic hormone. 
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FIGURE 4.41 Amino acid activation. 

Dicyclohexylcarbodiimide is used to 
activate carboxyl groups for the 
formation of peptide bonds. 


activate others to prevent unwanted reactions. The a-amino group of the 
first amino acid of the desired peptide is blocked with a tert-butyloxycar- 
bonyl (t-Boc) group, yielding a t-Boc amino acid. The carboxyl group of this 
same amino acid is activated by reacting it with a reagent such as dicyclohexyl¬ 
carbodiimide (DCC), as illustrated in Figure 4.41. The free amino group of 
the next amino acid to be linked attacks the activated carboxyl, leading to 
the formation of a peptide bond and the release of dicyclohexylurea. The car¬ 
boxyl group of the resulting dipeptide is activated with DCC and reacted 
with the free amino group of the amino acid that will be the third residue 


f-Boc 


N 

H 



+ 



Protected amino acid n 


Reactive resin 



H2N 



Couple with 

”3^ protected amino acid n - 1 



I Subsequent deprotection 
I and coupling cycles 



FIGURE 4.42 Solid-phase peptide synthesis. The sequence of steps in solid-phase 
synthesis is: (1) anchoring of the C-terminal amino acid, (2) deprotection of the amino 
terminus, and (3) coupling of the next residue. Steps 2 and 3 are repeated for each added 
amino acid. Finally, in step 4, the completed peptide is released from the resin. 
























in the peptide. This process is repeated until the desired peptide is synthe- 
sized. Exposing the peptide to dilute acid removes the t-Boc protecting group 
from the first amino acid while leaving peptide bonds intact. 

Peptides containing more than 100 amino acids can be synthesized by 
sequential repetition of the preceding reactions. Linking the growing pep¬ 
tide chain to an insoluble matrix, such as polystyrene beads, further en- 
hances efficiency. A major advantage of this solid-phase method is that the 
desired product at each stage is bound to beads that can be rapidly filtered 
and washed, and so there is no need to purify intermediates. All reactions 
are carried out in a single vessel, eliminating losses caused by repeated trans- 
fers of Products. The carboxyl-terminal amino acid of the desired peptide 
sequence is first anchored to the polystyrene beads (Figure 4.42). The t-Boc 
protecting group of this amino acid is then removed. The next amino acid 
(in the protected t-Boc form) and dicyclohexylcarbodiimide, the coupling 
agent, are added together. After the peptide bond forms, excess reagents 
and dicyclohexylurea are washed away, leaving the desired dipeptide prod¬ 
uct attached to the beads. Additional amino acids are linked by the same 
sequence of reactions. At the end of the synthesis, the peptide is released 
from the beads by adding hydrofluoric acid (HF), which cleaves the car¬ 
boxyl ester anchor without disrupting peptide bonds. Protecting groups on 
potentially reactive side chains, such as that of lysine, also are removed at 
this time. This cycle of reactions can be readily automated, which makes it 
feasible to routinely synthesize peptides containing about 50 residues in 
good yield and purity. In fact, the solid-phase method has been used to syn¬ 
thesize interferons (155 residues) that have antiviral activity and ribonucle- 
ase (124 residues) that is catalytically active. 
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Determination of 
Three-Dimensional Structure 


4.5 THREE-DIIVIENSIONAL PROTEIN STRUCTURE 
CAN BE DETERIVIINED BY NMR SPECTROSCOPY 
AND X-RAY CRYSTALLOGRAPHY 


A crucial question is, What does the three-dimensional structure of a spe- 
cific protein look like? Protein structure determines function, given that the 
specificity of active sites and binding sites depends on the precise three- 
dimensional conformation. Nuclear magnetic resonance spectroscopy and 
x-ray crystallography are two of the most important techniques for eluci- 
dating the conformation of proteins. 

4.5.1 Nuclear Magnetic Resonance Spectroscopy Can Reveal 
the Structures of Proteins in Solution 

Nuclear magnetic resonance (NMR) spectroscopy is unique in being able to 
reveal the atomic structure of macromolecules in solution, provided that 
highly concentrated Solutions (~1 mM, or 15 mg ml~^ for a 15-kd protein) 
can be obtained. This technique depends on the fact that certain atomic nu- 
clei are intrinsically magnetic. Only a limited number of isotopes display 
this property, called spin, and the ones most important to biochemistry are 
listed in Table 4.4. The simplest example is the hydrogen nucleus (^H), 
which is a proton. The spinning of a proton generates a magnetic moment. 
This moment can take either of two orientations, or spin States (called a and 
j8), when an externai magnetic field is applied (Figure 4.43). The energy dif- 
ference between these States is proportional to the strength of the imposed 
magnetic field. The a state has a slightly lower energy and hence is slightly 
more populated (by a factor of the order of 1.00001 in a typical experiment) 
because it is aligned with the field. A spinning proton in an a state can be 


TABLE 4.4 Biologically important 
nuclei giving NMR signals 


Nucleus 

Natural 
abundance 
(% by weight 
of the element) 


99.984 


0.016 

13c 

1.108 


99.635 

15N 

0.365 


0.037 


100.0 


10.05 

31p 

100.0 


75.4 

39 K 

93.1 
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FIGURE 4.43 Basís of NMR 
spectroscopy. The energies of the two 
orientations of a nucleus of spin \ (such 
as ^^P and ^H) depend on the strength of 
the appiied magnetic field. Absorption of 
electromagnetic radiation of appropriate 
frequency induces a transition from the 
lower to the upper levei. 


FIGURE 4.44 One-dimensíonal NMR 
spectra. (A) ^H-NMR spectrum of ethanol 
(CH3CH2OH) shows that the Chemical 
shifts for the hydrogen are clearly resolved. 
(B) ^H-NMR spectrum from a 55 amino 
acid fragment of a protein with a role in 
RNA splicing shows a greater degree of 
complexity. A large number of peaks are 
present and many overlap. [(A) After C. 
Branden and J. Tooze, Introduction to Protein 
Structure (Garland, 1991), p. 280; (B) courtesy 
of Barbara Amann and Wesley McDermott.] 


raised to an excited state (j 8 state) by applying a pulse of electromagnetic 
radiation (a radio-frequency, or RF, pulse), provided the frequency cor- 
responds to the energy difference between the a and the j 8 States. In these 
circumstances, the spin will change from o: to j 8 ; in other words, resonance 
will be obtained. A resonance spectrum for a molecule can be obtained by 
varying the magnetic field at a constant frequency of electromagnetic 
radiation or by keeping the magnetic field constant and varying electro¬ 
magnetic radiation. 

These properties can be used to examine the chemical surroundings of 
the hydrogen nucleus. The flow of electrons around a magnetic nucleus gen- 
erates a small local magnetic field that opposes the appiied field. The de¬ 
gree of such shielding depends on the surrounding electron density. Con- 
sequently, nuclei in different environments will change States, or resonate, 
at slightly different field strengths or radiation frequencies. The nuclei of 
the perturbed sample absorb electromagnetic radiation at a frequency that 
can be measured. The different frequencies, termed chemical shifts, are ex- 
pressed in fractional units S (parts per million, or ppm) relative to the shifts 
of a standard compound, such as a water-soluble derivative of tetra- 
methysilane, that is added with the sample. For example, a -CH 3 proton 
typically exhibits a chemical shift (S) of 1 ppm, compared with a chemical shift 
of 7 ppm for an aromatic proton. The chemical shifts of most protons in 
protein molecules fali between 0 and 9 ppm (Figure 4.44). It is possible 
to resolve most protons in many proteins by using this technique of one- 
dimensional NMR. With this information, we can then deduce changes to 
a particular chemical group under different conditions, such as the con- 
formational change of a protein from a disordered structure to an a helix 
in response to a change in pH. 

We can garner even more information by examining how the spins on 
different protons affect their neighbors. By inducing a transient magneti- 
zation in a sample through the application a radio-frequency pulse, it is pos¬ 
sible to alter the spin on one nucleus and examine the effect on the spin of 
a neighboring nucleus. Especially revealing is a two-dimensional spectrum 
obtained by nuclear Overhauser enhancement spectroscopy (NOESY), which 
graphically displays pairs of protons that are in dose proximity, even if they 
are not dose together in the primary structure. The basis for this technique 
is the nuclear Overhauser effect (NOE), an interaction between nuclei that 
is proportional to the inverse sixth power of the distance between them. 
Magnetization is transferred from an excited nucleus to an unexcited one if 
they are less than about 5 A apart (Figure 4.45A). In other words, the ef¬ 
fect pr o vides a means of detecting the location of atoms relative to one an- 
other in the three-dimensional structure of the protein. The diagonal of a 
NOESY spectrum corresponds to a one-dimensional spectrum. The off- 
diagonal peaks provide crucial new information: they identify pairs of protons 
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FIGURE 4.45 The nuclear Overhauser 
effect. The nuclear Overhauser effect 
(NOE) identifies pairs of protons that 
are in dose proximity. (A) Schematic 
representation of a polypeptide chain 
highlighting five particular protons. 

Protons 2 and 5 are in dose proximity 
(~4 Â apart), whereas other pairs are 
farther apart. (B) A highly simplified NOESY 
spectrum. The diagonal shows five peaks 
corresponding to the five protons in part 
A. The peaks above the diagonal and the 
symmetrically related one below reveal 
that proton 2 is dose to proton 5. 


that are less than 5 À apart (Figure 4.45B). A two-di- 
mensional NOESY spectrum for a protein comprising 55 
amino acids is shown in Figure 4.46. The large number 
of off-diagonal peaks reveals short proton-proton dis- 
tances. The three-dimensional structure of a protein can 
be reconstructed with the use of such proximity relations. 
Structures are calculated such that protons that must be 
separated by less than 5 A on the basis of NOESY spec- 
tra are dose to one another in the three-dimensional 
structure (Figure 4.47). If a sufficient number of distance 
constraints are applied, the three-dimensional structure 
can be determined nearly uniquely. A family of related 
structures is generated for three reasons (Figure 4.48). 
First, not enough constraints may be experimentally 
accessible to fully specify the structure. Second, the dis- 
tances obtained from analysis of the NOESY spectrum 
are only approximate. Finally, the experimental observa- 
tions are made not on single molecules but on a large 
number of molecules in solution that may have slightly 
different structures at any given moment. Thus, the fam¬ 
ily of structures generated from NMR structure analysis 
indicates the range of conformations for the protein in so¬ 
lution. At present, NMR spectroscopy can determine the 
structures of only relatively small proteins (<40 kd), but 
its revolving power is certain to increase. The power of 



FIGURE 4.46 Detecting short proton-proton dístances. 

A NOESY spectrum for a 55 amino acid domain from a protein 
having a role in RNA splicing. Each off-diagonal peak corresponds 
to a short proton-proton separation. This spectrum reveals 
hundreds of such short proton-proton distances, which can be 
used to determine the three-dimensional structure of this 
domain. [Courtesy of Barbara Amann and Wesley McDermott.] 
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FIGURE 4.47 Structures calculated 
on the basis of NMR constraints. 

(A) NOESY observations show that protons 
(connected by dotted red lines) are dose 
to one another in space. (B) A three- 
dimensional structure calculated with 
these proton pairs constrained to be 
dose together. 











FIGURE 4.48 A famíly of structures. A 

set of 25 structures for a 28 amino acid 
domain from a zinc-finger-DNA-binding 
protein. The red line traces the average 
course of the protein backbone. Each of 
these structures is consistent with 
hundreds of constraints derived from NMR 
experiments. The differences between the 
individual structures are due to a 
combination of imperfections in the 
experimental data and the dynamic nature 
of proteins in solution. [Courtesy of Barbara 
Amann.] 
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FIGURE 4.49 Essence of an x-ray 
crystallographic experíment: an x-ray 
beam, a crystal, and a detector. 


NMR has been greatly enhanced by the ability to produce proteins labeled 
uniformly or at specific sites with and with the use of recom- 

binant DNA technology (Chapter 6). 

4.5.2 X-Ray Crystallography Reveais Three-Dimensional 
Structure in Atomic Detail 

X-ray crystallography provides the finest visualization of protein structure 
currently available. This technique can reveal the precise three- dimensional 
positions of most atoms in a protein molecule. The use of x-rays provides the 
best resolution because the wavelength of x-rays is about the same length as 
that of a covalent bond. The three components in an x-ray crystallographic 
analysis are aprotón crystal, a source of x-rays, and a detector (Figure 4.49). 

The technique requires that all molecules be precisely oriented, so the first 
step is to obtain crystals of the protein of interest. Slowly adding ammonium 
sulfate or another salt to a concentrated solution of protein to reduce its sol- 
ubility favors the formation of highly ordered crystals. This is the process of 
salting out discussed in Section 4.1.3. For example, myoglobin crystallizes in 
3 M ammonium sulfate (Figure 4.50). Some proteins crystallize readily, 
whereas others do so only after much effort has been expended in identify- 
ing the right conditions. Crystallization is an art; the best practitioners have 
great perseverance and patience. Increasingly large and complex proteins are 
being crystallized. For example, poliovirus, an 8500-kd assembly of 240 pro¬ 
tein subunits surrounding an RN A core, has been crystallized and its struc¬ 
ture solved by x-ray methods. Crucially, protein crystals frequently display 
their biological activity, indicating that the proteins have crystallized in their 
biologically active configuration. For instance, enzyme crystals may display 
catalytic activity if the crystals are suffused with substrate. 

Next, a source of x-rays is required. A beam of x-rays of wavelength 
1 .54 À is produced by accelerating electrons against a copper target. A nar- 
row beam of x-rays strikes the protein crystal. Part of the beam goes straight 
through the crystal; the rest is scattered in various directions. Finally, these 
scattered, or diffracted, x-rays are detected by x-ray film, the blackening of 
the emulsion being proportional to the intensity of the scattered x-ray beam, 
or by a solid-State electronic detector. The scattering pattern provides abun- 
dant information about protein structure. The basic physical principies 
underlying the technique are: 

1 . Electrons scatter x-rays. The amplitude of the wave scattered by an atom 
is proportional to its number of electrons. Thus, a carbon atom scatters six 
times as strongly as a hydrogen atom does. 

2. The scattered waves recombine. Each atom contributes to each scattered 
beam. The scattered waves reinforce one another at the film or detector if they 
are in phase (in step) there, and they cancel one another if they are out of phase. 

3. The way in which the scattered waves recombine depends only on the atomic 
arrangement. 


FIGURE 4.50 Crystallization of 
myoglobin. 



Myoglobin in Myoglobin in Myoglobin crystals 

dilute buffer 3 M (NH 4 ) 2 S 04 , pH 7 







The protein crystal is mounted and positioned in a precise orientation 
with respect to the x-ray beam and the film. The crystal is rotated so that 
the beam can strike the crystal from many directions. This rotational mo- 
tion results in an x-ray photograph consisting of a regular array of spots 
called reflections. The x-ray photograph shown in Figure 4.51 is a two- 
dimensional section through a three-dimensional array of 25,000 spots. The 
intensity of each spot is measured. These intensities and their positions are 
the basic experimental data of an x-ray crystallographic analysis. The next 
step is to reconstruct an image of the protein from the observed intensities. 
In light microscopy or electron microscopy, the diffracted beams are focused 
by lenses to directly form an image. However, appropriate lenses for focus- 
ing x-rays do not exist. Instead, the image is formed by applying a mathe- 
matical relation called a Fourier transform. For each spot, this operation 
yields a wave of electron density whose amplitude is proportional to the 
square root of the observed intensity of the spot. Each wave also has a 
phase —that is, the timing of its crests and troughs relative to those of other 
waves. The phase of each wave determines whether the wave reinforces or 
cancels the waves contributed by the other spots. These phases can be de- 
duced from the well-understood diffraction patterns produced by electron- 
dense heavy-atom reference markers such as uranium or mercury at specific 
sites in the protein. 

The stage is then set for the calculation of an electron-density map, which 
gives the density of electrons at a large number of regularly spaced points 
in the crystal. This three-dimensional electron-density distribution is rep- 
resented by a series of parallel sections stacked on top of one another. Each 
section is a transparent plastic sheet (or, more recently, a layer in a Com¬ 
puter image) on which the electron-density distribution is represented by 
contour lines (Figure 4.52), like the contour lines used in geological survey 
maps to depict altitude (Figure 4.53). The next step is to interpret the elec¬ 
tron-density map. A criticai factor is the resolution of the x-ray analysis, 
which is determined by the number of scattered intensities used in the 
Fourier synthesis. The fidelity of the image depends on the resolution of the 
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FIGURE 4.51 Myoglobín crystal and 
x-ray. (A) Crystal of myoglobin. (B) X-ray 
precession photograph of a myoglobin 
crystal. [(A) Mel Poilinger/Fran Heyl 
Associates.] 


the iron atom. [From J. C. Kendrew. The three- 
dimensional structure of a protein molecule. 
Copyright © 1961 by Scientific American, Inc. 

All rights reserved.] 



FIGURE 4.53 Section of a U.S. 
Geological Survey map. Capitol Peak 
Quadrangle, Colorado. 






FIGURE 4.54 Resolutíon affects the 
quality of an ímage. The effect of 
resolution on the quality of a reconstructed 
image is shown by an optical analog of 
x-ray diffraction: (A) a photograph of 
the Parthenon; (B) an optical diffraction 
pattern of the Parthenon; (C and D) 
images reconstructed from the pattern 
in part B. More data were used to obtain 
image D than image C, which accounts 
for the higher quality of image D. 

[(A) Courtesy of Dr. Thomas Steitz. (B) Courtesy 
of Dr. David DeRosier).] 



Fourier synthesis, as shown by the optical analogy in Figure 4.54. A reso¬ 
lution of 6 A reveals the course of the polypeptide chain but few other struc- 
tural details. The reason is that polypeptide chains pack together so that their 
centers are between 5 A and 10 A apart. Maps at higher resolution are needed 
to delineate groups of atoms, which lie between 2.8 À and 4.0 A apart, and 
individual atoms, which are between 1.0 A and 1.5 A apart. The ultimate 
resolution of an x-ray analysis is determined by the degree of perfection of 
the crystal. For proteins, this limiting resolution is usually about 2 À. 

The structures of more than 10,000 proteins had been elucidated by 
NMR and x-ray crystallography by mid-2000, and several new structures 
are now determined each day. The coordinates are collected at the Protein 
Data Bank (http://www.rcsb.org/pdb) and the structures can be accessed 
for visualization and analysis. Knowledge of the detailed molecular archi- 
tecture of proteins has been a source of insight into how proteins recognize 
and bind other molecules, how they function as enzymes, how they fold, 
and how they evolved. This extraordinarily rich harvest is continuing at a 
rapid pace and is greatly influencing the entire field of biochemistry. 


SUMMARY 


The rapid progress in gene sequencing has advanced another goal of bio¬ 
chemistry—elucidation of the proteome. The proteome is the complete 
set of proteins expressed and includes information about how they are 
modified, how they function, and how they interact with other molecules. 

o The Purification of Proteins Is an Essential Step 
in Understanding Their Function 

Proteins can be separated from one another and from other molecules 
on the basis of such characteristics as solubility, size, charge, and bind- 
ing affinity. SDS-polyacrylamide gel electrophoresis separates the 
polypeptide chains of proteins under denaturing conditions largely ac- 
cording to mass. Proteins can also be separated electrophoretically on 
the basis of net charge by isoelectric focusing in a pH gradient. Ultra- 
centrifugation and gel-filtration chromatography resolve proteins ac- 
cording to size, whereas ion-exchange chromatography separates them 
mainly on the basis of net charge. The high affinity of many proteins 
for specific chemical groups is exploited in affinity chromatography, in 







which proteins bind to columns containing beads bearing covalently 
linked substrates, inhibitors, or other specifically recognized groups. 
The mass of a protein can be precisely determined by sedimentation 
equilibrium measurements or by mass spectrometry. 

• Amino Acid Sequences Can Be Determined 
by Automated Edman Degradation 

The amino acid composition of a protein can be ascertained by hy- 
drolyzing it into its constituent amino acids in 6 N HCl at 110°C. The 
amino acids can be separated by ion-exchange chromatography and 
quantitated by reacting them with ninhydrin or fluorescamine. Amino 
acid sequences can be determined by Edman degradation, which re¬ 
moves one amino acid at a time from the amino end of a peptide. Phenyl 
isothiocyanate reacts with the terminal amino group to form a phenyl- 
thiocarbamoyl derivative, which cydizes under mildly acidic conditions 
to give a phenylthiohydantoin-amino acid and a peptide shortened by 
one residue. Automated repeated Edman degradations by a sequenator 
can analyze sequences of about 50 residues. Longer polypeptide chains 
are broken into shorter ones for analysis by specifically cleaving them 
with a reagent such as cyanogen bromide, which splits peptide bonds 
on the carboxyl side of methionine residues. Enzymes such as trypsin, 
which cleaves on the carboxyl side of lysine and arginine residues, also 
are very useful in splitting proteins. Amino acid sequences are rich in 
information concerning the kinship of proteins, their evolutionary rela- 
tions, and diseases produced by mutations. Knowledge of a sequence 
provides valuable clues to conformation and function. 

• Immunology Provides Important Techniques with Which 
to Investigate Proteins 

Proteins can be detected and quantitated by highly specific antibodies; 
monoclonal antibodies are especially useful because they are homoge- 
neous. Enzyme-linked immunosorbent assays and Western blots of 
SDS-polyacrylamide gels are used extensively. Proteins can also be 
localized within cells by immunofluorescence microscopy and immu- 
noelectron microscopy. 

• Peptides Can Be Synthesized by Automated Solid-Phase Methods 

Polypeptide chains can be synthesized by automated solid-phase meth¬ 
ods in which the carboxyl end of the growing chain is linked to an insol- 
uble support. The n-carboxyl group of the incoming amino acid is acti- 
vated by dicyclohexylcarbodiimide and joined to the o:-amino group of 
the growing chain. Synthetic peptides can serve as drugs and as antigens 
to stimulate the formation of specific antibodies. They can also be sources 
of insight into relations between amino acid sequence and conformation. 

• Three-Dimensional Protein Structure Can Be Determined 
by NMR Spectroscopy and X-Ray Crystallography 

Nuclear magnetic resonance spectroscopy and x-ray crystallography 
have greatly enriched our understanding of how proteins fold, recog- 
nize other molecules, and catalyze chemical reactions. Nuclear magnetic 
resonance spectroscopy reveals the structure and dynamics of proteins 
in solution. The chemical shift of nuclei depends on their local envi- 
ronment. Furthermore, the spins of neighboring nuclei interact with 
each other in ways that pro vide definitive structural information. 

X-ray crystallography is possible because electrons scatter x-rays; 
the way in which the scattered waves recombine depends only on the 
atomic arrangement. The three- dimensional structures of thousands of 
proteins are now known in atomic detail. 



I 114 I CHAPTER 4 • Exploring Proteins 

-1 KEY TERMS |- 


proteome (p. 78) 
assay (p. 79) 
homogenate (p. 79) 
salting out (p. 80) 
dialysis (p. 80) 

gel-filtration chromatography (p. 81) 
ion-exchange chromatography (p. 81) 
affinity chromatography (p. 82) 
high-pressure liquid chromatography 
(HPLC) (p. 82) 
gel electrophoresis (p. 83) 
isoelectric point (p. 85) 
isoelectric focusing (p. 85) 


two-dimensional electrophoresis (p. 85) 
sedimentation coefficient (Svedberg 
units, S) (p. 88) 

matrix-assisted laser desorption- 
ionization— time of flight spectrometry 

(MALDI-TOF) (p. 89) 
dabsyl chloride (p. 92) 
dansyl chloride (p. 92) 

Edman degradation (p. 92) 
phenyl isothiocy anate (p. 92) 
cyanogen bromide (CNBr) (p. 94) 
overlap peptides (p. 95) 
diagonal electrophoresis (p. 96) 


antibody (p. 98) 
antigen (p. 98) 

antigenic determinant (epitope) (p. 98) 
monoclonal antibodies (p. 100) 
enzyme-linked immunosorbent assay 
(ELISA) (p. 101) 

Western blotting (p. 103) 
fluorescence microscopy (p. 103) 
green fluorescent protein (GEP) (p. 103) 
solid-phase method (p. 107) 
nuclear magnetic resonance (NMR) 
spectroscopy (p. 107) 
x-ray crystallography (p. 110) 
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PROBLEMS 


1. Valuable reagents. The following reagents are often used in 
protein chemistry: 


CNBr 

Urea 

Mercaptoethanol 
Trypsin 
Performic acid 


Dabsyl chloride 

6NHC1 

Ninhydrin 

Phenyl isothiocyanate 
Chymotrypsin 


Which one is the best suited for accomplishing each of the fol¬ 
lowing tasks? 


(a) Determination of the amino acid sequence of a small peptide. 


(b) Identification of the amino-terminal residue of a peptide (of 
which you have less than 0.1 jütg). 

(c) Reversible denaturation of a protein devoid of disulfide 
bonds. Which additional reagent would you need if disulfide 
bonds were present? 


(d) Hydrolysis of peptide bonds on the carboxyl side of aromatic 
residues. 


(e) Cleavage of peptide bonds on the carboxyl side of methionines. 

(f) Hydrolysis of peptide bonds on the carboxyl side of lysine 
and arginine residues. 


2 . Finding an end. Anhydrous hydrazine (H2N—NH2) has been 
used to cleave peptide bonds in proteins. What are the reaction 
Products? How might this technique be used to identify the car¬ 
boxyl-terminal amino acid? 


3. Crafting a new hreakpoint. Ethyleneimine reacts with cysteine 
side chains in proteins to form S-aminoethyl derivatives. The 
peptide bonds on the carboxyl side of these modified cysteine 
residues are susceptible to hydrolysis by trypsin. Why? 

4. Spectrometry. The absorbance A of a solution is defined as 

A = logio (lo/I) 

in which Iq is the incident light intensity and I is the transmit- 
ted light intensity. The absorbance is related to the molar ab- 
sorption coefficient (extinction coefficient) e (in cm~^), 

concentration c (in M), and path length I (in cm) by 

A = ele 

The absorption coefficient of myoglobin at 580 nm is 
15,000 cm~^. What is the absorbance of a 1 mg ml~^ so¬ 


lution across a 1 - cm path? What percentage of the incident light 
is transmitted by this solution? 

5. A slow mover. Tropomyosin, a 93-kd muscle protein, sediments 
more slowly than does hemoglobin (65 kd). Their sedimentation 
coefficients are 2.6S and 4.31S, respectively. Which structural 
feature of tropomyosin aceounts for its slow sedimentation? 

6. Sedimenting spheres. What is the dependence of the sedimenta¬ 
tion coefficient S of a spherical protein on its mass? How much more 
rapidly does an 80-kd protein sediment than does a 40-kd protein? 

7. Size estimate. The relative electrophoretic mobilities of a 
30-kd protein and a 92-kd protein used as standards on an 
SDS-polyacrylamide gel are 0.80 and 0.41, respectively. What is the 
apparent mass of a protein having a mobility of 0.62 on this gel? 

8. A new partnership? The gene encoding a protein with a sin¬ 
gle disulfide bond undergoes a mutation that changes a serine 
residue into a cysteine residue. You want to find out whether the 
disulfide pairing in this mutant is the same as in the original pro¬ 
tein. Propose an experiment to directly answer this question. 

9. Sorting cells. Fluorescence-activated cell sorting (FACS) is a 
powerful technique for separating cells aceording to their con- 
tent of particular molecules. For example, a fluorescence-labeled 
antibody specific for a cell-surface protein can be used to detect 
cells containing such a molecule. Suppose that you want to iso- 
late cells that possess a receptor enabling them to detect bacte- 
rial degradation produets. However, you do not yet have an 
antibody directed against this receptor. Which fluorescence- 
labeled molecule would you prepare to identify such cells? 

10. Column choice. (a) The octapeptide AVGWRVKS was di- 
gested with the enzyme trypsin. Would ion exchange or molec¬ 
ular exclusion be most appropriate for separating the produets? 
Explain. (b) Suppose that the peptide was digested with chy¬ 
motrypsin. What would be the optimal separation technique? 
Explain. 

11 . Making more enzyme? In the course of purifying an enzyme, 
a researcher performs a purification step that results in an in- 
crease in the total activity to a value greater than that present in 
the original crude extract. Explain how the amount of total ac¬ 
tivity might increase. 

12. Protein purification problem. Complete the table at the top 
of the next page. 
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Purification 

procedure 

Total protein 
(mg) 

Total activity 
(units) 

Specific activity 
(units mg~^) 

Purification 

Icvel 

Yield 

(%) 

Crude extract 

20,000 

4,000,000 


1 

100 

(NH) 4 S 04 precipitation 

5,000 

3,000,000 




DEAE-cellulose chromatography 

1,500 

1,000,000 




Size-exclusion chromatography 

500 

750,000 




Affinity chromatography 

45 

675,000 





Chapter Integration Problems 

13. Quaternary structure. A protein was purified to homogeneity. 
Determination of the molecular weight by molecular exclusion 
chromatography yields 60 kd. Chromatography in the presence 
of 6 M urea yields a 30-kd species. When the chromatog¬ 
raphy is repeated in the presence of 6 M urea and 10 mM 
P-mercaptoethanol, a single molecular species of 15 kd results. 
Describe the structure of the molecule. 

14. Helix~coil transitions. 

(a) NMR measurements have shown that poly-L-lysine is a ran- 
dom coil at pH 7 but becomes a helical as the pH is raised above 
10. Account for this pH-dependent conformational transition. 

(b) Predict the pH dependence of the helix-coil transition of 
poly - L - glut amate. 

15. Peptides on a chip. Large numbers of different peptides can 
be synthesized in a small area on a solid support. This high-den- 
sity array can then be probed with a fluorescence-labeled pro¬ 
tein to find out which peptides are recognized. The binding of 
an antibody to an array of 1024 different peptides occupying a 
total area the size of a thumbnail is shown in the figure below. 
How would you synthesize such a peptide array? [Hint: Use light 
instead of acid to deprotect the terminal amino group in each 
round of synthesis.] 



Fluorescente scan of an array of 1024 peptides in a 1.6-cm^ 
area. Each synthesis site is a 400-jann square. A fluorescentiy 
labeled monoclonal antibody was added to the array to identify 
peptides that are recognized. The height and color of each square 
denote the fluorescence intensity. [After S. P. A. Fodor, J. O. Read, M. C. 
Pirrung, L. Stryer, A. T. Lu, and D. Solas. Science 251 (1991 ):767.] 


Data Interpretation Problems 

16. Protein sequencing 1. Determine the sequence of hexapeptide 
based on the following data. Note: When the sequence is not 
known, a comma separates the amino acids. (See Table 4.3) 

Amino acid composition: (2R,A,S,V,Y) 

N-terminal analysis of the hexapeptide: A 
Trypsin digestion: (R,A,V) and (R,S,Y) 

Carboxypeptidase digestion: No digestion. 

Chymotrypsin digestion: (A,R,V,Y) and (R,S) 

17. Protein sequencing 11. Determine the sequence of a peptide 
consisting of 14 amino acids on the basis of the following data. 

Amino acid composition: (4S,2L,F,G,I,M,T,W,Y) 

N-terminal analysis: S 
Carboxypeptidase digestion: L 
Trypsin digestion: (3S,2L,F,I,M,T,W) (G,K,S,Y) 
Chymotrypsin digestion: 

(F,I,S) (G,K,L) (L,S) (M,T) (S,W) (S,Y) 

N-terminal analysis of (F,I,S) peptide: S 
Cyanogen bromide treatment: 

(2S,F,G,I,K,L,M",T,Y) (2S,L,W) 

M , methionine detected as homoserine 

18. Edman degradation. Alanine amide was treated with phenyl 
isothiocy anate to form PTH-alanine. Write a mechanism for 
this reaction. 




DNA, RNA, and the Flow 
of Genetic Information 




Having genes in common accounts for the resemblance of a 
mother and her daughters. Genes must be expressed to exert an 
effect, and proteins regulate such expression. One such regulatory 
protein, a zinc-finger protein (zinc ion is blue, protein is red), is 
shown bound to a control or promoter region of DNA (black). 

[(Left) Barnaby Hall/Photonica.] 


OUTLINE 


DNA and RNA are long linear polymers, called nucleic acids, that carry in- 
formation in a form that can be passed from one generation to the next. 
These macromolecules consist of a large number of linked nucleotides, each 
composed of a sugar, a phosphate, and a base. Sugars linked by phosphates 
form a common backbone, whereas the bases vary among 
four kinds. Genetic information is stored in the sequence of 
bases along a nucleic acid chain. The bases have an addi- 
tional special property: they form specific pairs with one 
another that are stabilized by hydrogen bonds. The base 
pairing results in the formation of a double helix, a helical 
structure consisting of two strands. These base pairs provide 
a mechanism for copying the genetic information in an exist- 
ing nucleic acid chain to form a new chain. Although RNA 
probably functioned as the genetic material very early in 
evolutionary history, the genes of all modern cells and many 
viruses are made of DNA. DNA is replicated by the action 
of DNA polymerase enzymes. These exquisitely specific 
enzymes copy sequences from nucleic acid templates with 
an error rate of less than 1 in 100 million nucleotides. 

Genes specify the kinds of proteins that are made by 
cells, but DNA is not the direct template for protein syn- 
thesis. Rather, the templates for protein synthesis are 
RNA (ribonucleic acid) molecules. In particular, a class of 
RNA molecules called messenger RNA (mRNA) are the 
information-carrying intermediates in protein synthesis. 

Other RNA molecules, such as transfer RNA (tRNA) and 
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HO OH 

Ribose 



Deoxyribose 

FIGURE 5.2 Ribose and deoxyribose. 

Atoms are numbered with primes to 
distinguish them from atoms in bases 
(see Figure 5.4). 


ribosomal RNA (rRNA), are part of the protein-synthesizing machinery. All 
forms of cellular RNA are synthesized by RNA polymerases that take in- 
structions from DNA templates. This process of transcription is followed by 
translation, the synthesis of proteins according to instructions given by 
mRNA templates. Thus, the flow of genetic information, or gene expression, 
in normal cells is: 

_ ^ ^ . Transcription _ ^ ^ . Translation _ 

DNA-> RNA-> Protem 

This flow of information is dependent on the genetic code, which de¬ 
fines the relation between the sequence of bases in DNA (or its mRNA tran- 
script) and the sequence of amino acids in a protein. The code is nearly the 
same in all organisms: a sequence of three bases, called a codon, specifies an 
amino acid. Codons in mRNA are read sequentially by tRNA molecules, 
which serve as adaptors in protein synthesis. Protein synthesis takes place 
on ribosomes, which are complex assemblies of rRNAs and more than 50 
kinds of proteins. 

The last theme to be considered is the interrupted character of most eu- 
karyotic genes, which are mosaics of nucleic acid sequences called introns 
and exons. Both are transcribed, but introns are cut out of newly synthe¬ 
sized RNA molecules, leaving mature RNA molecules with continuous ex¬ 
ons. The existence of introns and exons has crucial implications for the evo- 
lution of proteins. 


5.1 A NUCLEIC ACID CONSISTS OF FOUR KINDS 
OF BASES LINKED TO A SUGAR-PHOSPHATE BACKBONE 


The nucleic acids DNA and RNA are well suited to function as the carriers 
of genetic information by virtue of their covalent structures. These macro- 
molecules are linear polymers built up from similar units connected end to 
end (Figure 5.1). Each monomer unit within the polymer consists of three 
components: a sugar, a phosphate, and a base. The sequence of bases uniquely 
characterizes a nucleic acid and represents a form of linear information. 



Base, 

Base/+i 

Base,+2 


^ Sugar 

Sugar 

Sugar 

Sugar 

Sugar 

_ á 


Phosphate Phosphate Phosphate Phosphate Phosphate 


FIGURE 5.1 Polymeríc structure of nucleic acids. 


5.1.1 RNA and DNA Differ in the Sugar Component 
and One of the Bases 

The sugar in deoxyribonucleic acid (DNA) is deoxyribose. The deoxy prefix 
indicates that the 2' carbon atom of the sugar lacks the oxygen atom that is 
linked to the 2' carbon atom of ribose (the sugar in ribonucleic acid, or RNA), 
as shown in Figure 5.2. The sugars in nucleic acids are linked to one another 
by phosphodiester bridges. Specifically, the 3'-hydroxyl (3'-OH) group of 
the sugar moiety of one nucleotide is esterified to a phosphate group, which 
is, in turn, joined to the 5'-hydroxyl group of the adjacent sugar. The chain 
of sugars linked by phosphodiester bridges is referred to as the backbone of 
the nucleic acid (Figure 5.3). Whereas the backbone is constant in DNA 
and RNA, the bases vary from one monomer to the next. Two of the bases 






base 





RNA 


H 119 I — 

Nucleic Adds 


FIGURE 5.3 Backbones of DNA 
and RNA. The backbones of these 
nucleic acids are formed by 3'-to-5' 
phosphodiester linkages. A sugar unit 
is highlighted in red and a phosphate 
group in blue. 


are derivatives of purine —adenine (A) and guanine (G)— and two of pyrim- 
idine —cytosine (C) and thymine (T, DNA only) or uracil (U, RNA only), 
as shown in Figure 5.4. 

RNA, like DNA, is a long unbranched polymer consisting of nucleotides 
joined by 3'^ 5' phosphodiester bonds (see Figure 5.3). The covalent 
structure of RNA differs from that of DNA in two respects. As stated ear- 
lier and as indicated by its name, the sugar units in RNA are riboses rather 
than deoxyriboses. Ribose contains a 2'-hydroxyl group not present in de- 
oxyribose. As a consequence, in addition to the standard 3'^ 5' linkage, a 
2 '^ 5' linkage is possible for RNA. This later linkage is important in the 
removal of introns and the joining of exons for the formation of mature RNA 
(Section 28.x.x). The other difference, as already mentioned, is that one of 
the four major bases in RNA is uracil (U) instead of thymine (T). 

Note that each phosphodiester bridge has a negative charge. This nega- 
tive charge repels nucleophilic species such as hydroxide ion; consequently, 
phosphodiester linkages are much less susceptible to hydrolytic attack than 
are other esters such as carboxylic acid esters. This resistance is crucial for 
maintaining the integrity of Information stored in nucleic acids. The ab- 
sence of the 2'-hydroxyl group in DNA further increases its resistance to 
hydrolysis. The greater stability of DNA probably accounts for its use rather 
than RNA as the hereditary material in all modern cells and in many viruses. 



FIGURE 5.4 Purínes and pyrimidines. Atoms within bases are numbered without 
primes. Uracil instead of thymine is used in RNA. 
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5.1.2 Nucleotides Are the Monomeric Units of Nucleic Acids 


I nudeotid 


STRUCTURAL INSIGHTS, Nucleic Acids offers a three-dimensional perspective on 
nucleotide structure, base pairing, and other aspects of DNA and RNA structure. 



FIGURE 5.5 j3-Glycosidic linkage in a 
nucleoside. 


A unit consisting of a base bonded to a sugar is referred to as a nucleo side . 
The four nucleoside units in RNA are called adenosine, guanosine, cytidine, 
and uridine, whereas those in DNA are called deoxyadenosine, deoxyguano¬ 
sine, deoxycytidine, and thymidine. In each case, N-9 of a purine or N-1 of a 
pyrimidine is attached to C-1' of the sugar (Figure 5.5). The base lies above 
the plane of sugar when the structure is written in the standard orientation; 
that is, the configuration of the N-glycosidic linkage is j8. A nucleo tide is a 
nucleoside joined to one or more phosphate groups by an ester linkage. The 
most common site of esterification in naturally occurring nucleotides is the 
hydroxyl group attached to C-5' of the sugar. A compound formed by the 
attachment of a phosphate group to the C-5' of a nucleoside sugar is called 
a nucleoside 5'-phosphate or a 5'-nucleotide. For example, ATP is adenosine 
5'-triphosphate. Another nucleotide is deoxyguanosine 3'-monophosphate 
(3'-dGMP; Figure 5.6). This nucleotide differs from ATP in that it con- 
tains guanine rather than adenine, contains deoxyribose rather than ribose 
(indicated by the prefix ''d"), contains one rather than three phosphates, 
and has the phosphate esterified to the hydroxyl group in the 3' rather than 
the 5' position. Nucleotides are the monomers that are linked to form RNA 
and DNA. The four nucleotide units in DNA are called deoxyadenylate, de- 
oxyguanylate, deoxycytidylate, and thymidylate. Note that thymidylate con¬ 
tains deoxyribose; by convention, the prefix deoxy is not added because 
thymine-containing nucleotides are only rarely found in RNA. 



5'-ATP 


3'-dGIVIP 


A C G 



FIGURE 5.7 Structure of a DNA chain. 

The chain has a 5' end, which is usually 
attached to a phosphate, and a 3' end, 
which is usually a free hydroxyl group. 


FIGURE 5.6 Nucleotides adenosine 5'-triphosphate (5'-ATP) and deoxyguanosine 
3'-monophosphate (3'-dGIVIP). 

The abbreviated notations pApCpG or pACG denote a trinucleotide of 
DNA consisting of the building blocks deoxyadenylate monophosphate, de- 
oxycytidylate monophosphate, and deoxyguanylate monophosphate linked 
by a phosphodiester bridge, where ''p" denotes a phosphate group (Figure 
5.7). The 5' end will often have a phosphate attached to the 5'-OH group. 
Note that, like a polypeptide (see Section 3.2), a DNA chain has polarity. 
One end of the chain has a free 5'-OH group (or a 5'-OH group attached 
to a phosphate), whereas the other end has a 3'-OH group, neither of which 
is linked to another nucleotide. By convention, the base sequence is written 
in the 5'-to-3' direction. Thus, the symbol ACG indicates that the unlinked 
5'-OH group is on deoxyadenylate, whereas the unlinked 3'-OH group is 
on deoxyguanylate. Because of this polarity, ACG and GCA correspond to 
different compounds. 











A striking characteristic of naturally occurring DNA molecules is their 
length. A DNA molecule must comprise many nucleotides to carry the ge- 
netic information necessary for even the simplest organisms. For example, 
the DNA of a virus such as polyoma, which can cause câncer in certain or¬ 
ganisms, is 5100 nucleotides in length. We can quantify the information car- 
rying capacity of nucleic acids in the following way. Each position can be 
one of four bases, corresponding to two bits of information (2^ = 4). Thus, 
a chain of 5100 nucleotides corresponds to 2 X 5100 = 10,200 bits, or 1275 
bytes (1 byte = 8 bits). The E. coli genome is a single DNA molecule con- 
sisting of two chains of 4.6 million nucleotides, corresponding to 9.2 mil- 
lion bits, or 1.15 megabytes, of information (Figure 5.8). 

DNA molecules from higher organisms can be much larger. The human 
genome comprises approximately 3 billion nucleotides, divided among 24 
distinct DNA molecules (22 autosomes, x and y sex chromosomes) of dif- 
ferent sizes. One of the largest known DNA molecules is found in the In- 
dian muntjak, an Asiatic deer; its genome is nearly as large as the human 
genome but is distributed on only 3 chromosomes (Figure 5.9). The largest 
of these chromosomes has chains of more than 1 billion nucleotides. If such 
a DNA molecule could be fully extended, it would stretch more than 1 foot 
in length. Some plants contain even larger DNA molecules. 



FIGURE 5.8 Electron micrograph of 
part of the £. coli genome. [Dr. Copal 
Murti/Science Photo Library/Photo 
Researchers.] 



FIGURE 5.9 The Indían muntjak and 
its chromosomes. Cells from a female 
Indian muntjak (right) contain three 
pairs of very large chromosomes (stained 
orange). The cell shown is a hybrid 
containing a pair of human chromosomes 
(stained green) for comparison. [(Left) M. 
Birkhead, OSF/Animals Animais. (Right) J-Y Lee, 
M. Koi, E. J. Stanbridge, M. Oshimura, A. T. 
Kumamoto, and A. P. Feinberg. Nature Genetics 
7(1994):30.] 


5.2 A PAIR OF NUCLEIC ACID CHAINS WITH 
COMPLEIVIENTARY SEQUENCES CAN FORM A 
DOUBLE-HELICAL STRUCTURE 

The covalent structure of nucleic acids accounts for their ability to carry in¬ 
formation in the form of a sequence of bases along a nucleic acid chain. 
Other features of nucleic acid structure facilitate the process of replication — 
that is, the generation of two copies of a nucleic acid from one. These fea¬ 
tures depend on the ability of the bases found in nucleic acids to form spe- 
cific base pairs in such a way that a helical structure consisting of two strands 
is formed. The double-helical structure of DNA facilitates the replication 
of the genetic material (Section 5.2.2). 

5.2.1 The Double Helix Is Stabilized by Hydrogen Bonds 
and Hydrophobic Interactions 

The existence of specific base-pairing interactions was discovered in the 
course of studies directed at determining the three-dimensional structure of 
DNA. Maurice Wilkins and Rosalind Franklin obtained x-ray diffraction 
photographs of fibers of DNA (Figure 5.10). The characteristics of these 
diffraction patterns indicated that DNA was formed of two chains that 
wound in a regular helical structure. From these and other data, James Wat- 
son and Francis Crick inferred a structural model for DNA that accounted 



FIGURE 5.10 X-ray diffraction 
photograph of a hydrated DNA fiber. 

The central cross is diagnostic of a helical 
structure. The strong ares on the meridian 
arise from the stack of nucleotide bases, 
which are 3.4 Â a part. [Courtesy of 
Dr. Maurice Wilkins.] 












FIGURE 5.11 Watson-Críck model of double-helícal DNA. 

One polynucleotide chain is shown in blue and the other in 
red. The purine and pyrimidine bases are shown in lighter 
colors than the sugar-phosphate backbone. (A) Axial view. The 
structure repeats along the helical axis (vertical) at intervais of 
34 Â, which corresponds to 10 nucleotides on each chain. 

(B) Radial view, looking down the helix axis. 


for the diffraction pattern and was also the source of some remarkable in- 
sights into the functional properties of nucleic acids (Figure 5.11). 

The features of the Watson-Crick model of DNA deduced from the dif¬ 
fraction patterns are: 

1 . Two helical polynucleotide chains are coiled around a common axis. The 
chains run in opposite directions. 

2. The sugar-phosphate backbones are on the outside and, therefore, the 
purine and pyrimidine bases lie on the inside of the helix. 

3. The bases are nearly perpendicular to the helix axis, and adjacent bases 
are separated by 3.4 À. The helical structure repeats every 34 À, so there 
are 10 bases (= 34 À per repeat/3.4 À per base) per turn of helix. There is 
a rotation of 36 degrees per base (360 degrees per full turn/10 bases per 
turn). 

4. The diameter of the helix is 20 À. 



FIGURE 5.12 Structures of the base 
pairs proposed by Watson and Críck. 


How is such a regular structure able to accommodate an arbitrary se- 
quence of bases, given the different sizes and shapes of the purines and 
pyrimidines? In attempting to answer this question, Watson and Crick dis- 
covered that guanine can be paired with cytosine and adenine with thymine 
to form base pairs that have essentially the same shape (Figure 5.12). These 
base pairs are held together by specific hydrogen bonds. This base-pairing 
scheme was supported by earlier studies of the base composition of DNA 
from different species. In 1950, Erwin Chargaff reported that the ratios of 
adenine to thymine and of guanine to cytosine were nearly the same in all 
species studied. Note in Table 5.1 that all the adenine:thymine and gua- 
nine:cytosine ratios are dose to 1, whereas the adenine-to-guanine ratio 
varies considerably. The meaning of these equivalences was not evident un- 
til the Watson-Crick model was proposed, when it became clear that they 
represent an essential facet of DNA structure. 

The spacing of approximately 3.4 À between nearly parallel base pairs 
is readily apparent in the DNA diffraction pattern (see Figure 5.10). The 


r 


TABLE 5.1 Base compositions experimentally determined for a variety 
of organisms 


Spedes 

A:T 

G:C 

A:G 

Human being 

1.00 

1.00 

1.56 

Salmon 

1.02 

1.02 

1.43 

Wheat 

1.00 

0.97 

1.22 

Yeast 

1.03 

1.02 

1.67 

Escherichia coli 

1.09 

0.99 

1.05 

Serratia marcescens 

0.95 

0.86 

0.70 










stacking of bases one on top of another contributes to the stability of the 
double helix in two ways (Figure 5.13). First, adjacent base pairs attract one 
another through van der Waals forces (Section 1.3.1). Energies associated 
with van der Waals interactions are quite small, such that typical interac- 
tions contribute from 0.5 to 1.0 kcal moP^ per atom pair. In the double he¬ 
lix, however, a large number of atoms are in van der Waals contact, and the 
net effect, summed over these atom pairs, is substantial. In addition, the 
double helix is stabilized by the hydrophobic effect (Section 1.3.4): base 
stacking, or hydrophobic interactions between the bases, results in the ex- 
posure of the more polar surfaces to the surrounding water. This arrange- 
ment is reminiscent of protein folding, where hydrophobic amino acids are 
interior in the protein and hydrophilic are exterior (Section 3.4). Base stack¬ 
ing in DNA is also favored by the conformations of the relatively rigid five- 
membered rings of the backbone sugars. The sugar rigidity affects both the 
single-stranded and the double-helical forms. 

5.2.2 The Double Helix Facilitates the Accurate Transmission 
of Hereditary Information 

The double-helical model of DNA and the presence of specific base pairs 
immediately suggested how the genetic material might replicate. The se- 
quence of bases of one strand of the double helix precisely determines the 
sequence of the other strand; a guanine base on one strand is always paired 
with a cytosine base on the other strand, and so on. Thus, separation of a 
double helix into its two component chains would yield two single-stranded 
templates onto which new double helices could be constructed, each of 
which would have the same sequence of bases as the parent double helix. 
Consequently, as DNA is replicated, one of the chains of each daughter 
DNA molecule would be newly synthesized, whereas the other would be 
passed unchanged from the parent DNA molecule. This distribution of 
parental atoms is achieved by semiconservative replication. 

Matthew Meselson and Franklin Stahl carried out a criticai test of this 
hypothesis in 1958. They labeled the parent DNA with ^^N, a heavy iso- 
tope of nitrogen, to make it denser than ordinary DNA. The labeled DNA 
was generated by growing E. coli for many generations in a médium that 
contained ^^NH4C1 as the sole nitrogen source. After the incorporation of 
heavy nitrogen was complete, the bactéria were abruptly transferred to a 
médium that contained ^"^N, the ordinary isotope of nitrogen. The question 
asked was: What is the distribution of ^"^N and ^^N in the DNA molecules 
after successive rounds of replication? 

The distribution of ^"^N and ^^N was revealed by the technique of 
density-gradient equilibrium sedimentation. A small amount of DNA was dis- 
solved in a concentrated solution of cesium chloride having a density dose to 
that of the DNA (1.7 g cm~^). This solution was centrifuged until it was nearly 
at equilibrium. The opposing processes of sedimentation and diffusion created 
a gradient in the concentration of cesium chloride across the centrifuge cell. 
The result was a stable density gradient, ranging from 1.66 to 1.76 g cm~^. 
The DNA molecules in this density gradient were driven by centrifugai force 
into the region where the solution s density was equal to their own. The ge- 
nomic DNA yielded a narrow band that was detected by its absorption of ul- 
traviolet light. A mixture of ^"^N DNA and ^^N DNA molecules gave clearly 
separate bands because they differ in density by about 1% (Figure 5.14). 

DNA was extracted from the bactéria at various times after they were 
transferred from a ^^N to a ^"^N médium and centrifuged. Analysis of these 
samples showed that there was a single band of DNA after one generation. 
The density of this band was precisely halfway between the densities of the 
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FIGURE 5.13 Axial víew of DNA. 

Base pairs are stacked nearly one on top 
of another in the double helix. 



FIGURE 5.14 Resolution of DNA 
and '^N DNA by density-gradient 
centrifugation. (A) Ultraviolet absorption 
photograph of a centrifuge cell showing 
the two distinct bands of DNA. 

(B) Densitometric tracing of the absorption 
photograph. [From M. Meselson and F. W. 

Stahl. Proc. Natl. Acod. Sd. U.S.A. 44(1958):671.] 



FIGURE 5.15 Detectíon of semícon- 
servative replícation of £. co/i DNA by 
densíty-gradient centrifugation. The 

position of a band of DNA depends on 
its content of and After 1.0 
generation, all of the DNA molecules were 
hybrids containing equal amounts of ^^N 
and ^^N. [From M. Meselson and F. W. StahI. 
Proc. Natl. Acad. Sei. U.S.A. 44(1958);671.] 



Original parent molecule 



First-generation daughter molecules 



Second-generation daughter molecules 


FIGURE 5.16 Diagram of 
semíconservative replícation. 

Parental DNA is shown in blue and newly 
synthesized DNA in red. [After M. Meselson 
and F. W. StahI. Proc. Natl. Acad. Sei. U.S.A. 
44(1958):671.] 
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DNA and DNA bands (Figure 5.15). The absence of '^N DNA 
indicated that parental DNA was not preserved as an intact unit after repli- 
cation. The absence of ^"^N DNA indicated that all the daughter DNA de- 
rived some of their atoms from the parent DNA. This proportion had to be 
half because the density of the hybrid DNA band was halfway between the 
densities of the ^"^N DNA and ^^N DNA bands. 

After two generations, there were equal amounts of two bands of DNA. 
One was hybrid DNA, and the other was ^"^N DNA. Meselson and Stahl 
concluded from these incisive experiments ‘‘that the nitrogen in a DNA mol¬ 
ecule is divided equally between two physically continuous subunits; that 
following duplication, each daughter molecule receives one of these; and that 
the subunits are conserved through many duplications.’' Their results agreed 
perfectly with the Watson-Crick model for DNA replication (Figure 5.16). 

5.2.3 The Double Helix Can Be Reversibly Melted 

During DNA replication and other processes, the two strands of the dou¬ 
ble helix must be separated from one another, at least in a local region. In 
the laboratory, the double helix can be disrupted by heating a solution of 
DNA. The heating disrupts the hydrogen bonds between base pairs and 
thereby causes the strands to separate. The dissociation of the double helix 
is often called melting because it oceurs relatively abruptly at a certain tem- 
perature. The melting temperature (T^) is defined as the temperature at 
which half the helical structure is lost. Strands may also be separated by 
adding acid or alkali to ionize the nucleotide bases and disrupt base pairing. 

Stacked bases in nucleic acids absorb less ultraviolet light than do un- 
stacked bases, an effect called hypochromism. Thus, the melting of nucleic 
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FIGURE 5.17 Hypochromísm. 

(A) Single-stranded DNA absorbs light 
more effectively than does double-helical 
DNA. (B) The absorbance of a DNA 
solution at a wavelength of 260 nm 
increases when the double helix is melted 
into single strands. 


acids is easily followed by monitoring their absorption of light, which peaks 
at a wavelength of 260 nm (Figure 5.17). 

Separated complementary strands of nucleic acids spontaneously reas- 
sociate to form a double helix when the temperature is lowered below T^. 
This renaturation process is sometimes called annealing. The facility with 
which double helices can be melted and then reassociated is crucial for the 
biological functions of nucleic acids. Of course, inside cells, the double he- 
lix is not melted by the addition of heat. Instead, proteins called helicases 
use Chemical energy (from ATP) to disrupt the structure of double-stranded 
nucleic acid molecules. 

The ability to melt and reanneal DNA reversibly in the laboratory pro- 
vides a powerful tool for investigating sequence similarity as well as gene 
structure and expression. For instance, DNA molecules from two different 
organisms can be melted and allowed to reanneal or hybridize in the pres- 
ence of each other. If the sequences are similar, hybrid DNA duplexes, with 
DNA from each organism contributing a strand of the double helix, can 
form. Indeed, the degree of hybridization is an indication of the relatedness 
of the genomes and hence the organisms. Similar hybridization experiments 
with RN A and DNA can locate genes in a celhs DNA that correspond to a 
particular RN A. We will return to this important technique in Chapter 6. 

5.2.4 Some DNA Molecules Are Circular and Supercoiled 

The DNA molecules in human chromosomes are linear. However, electron 
microscopic and other studies have shown that intact DNA molecules from 
some other organisms are circular (Figure 5.ISA). The term circular refers 
to the continuity of the DNA chains, not to their geometrical form. DNA 
molecules inside cells necessarily have a very compact shape. Note that the 
E. coli chromosome, fully extended, would be about 1000 times as long as 
the greatest diameter of the bacterium. 

A new property appears in the conversion of a linear DNA molecule into 
a closed circular molecule. The axis of the double helix can itself be twisted 
into a superhelix (Figure 5.18B). A circular DNA molecule without any su- 
perhelical turns is known as a relaxed molecule. Supercoiling is biologically 
important for two reasons. First, a supercoiled DNA molecule has a more 
compact shape than does its relaxed counterpart. Second, supercoiling may hin- 
der or favor the capacity of the double helix to unwind and thereby affects the 
interactions between DNA and other molecules. These topological features of 
DNA will be considered further in Section 27.x.x. 


(A) 

1 jum 




FIGURE 5.18 Electron micrographs 
of circular DNA from mitochondría. 

(A) Relaxed form. (B) Supercoiled form. 
[Courtesy of Dr. David Clayton.] 
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5.2.5 Single-Stranded Nucleic Acids Can Adopt 
Elaborate Structures 

Single-stranded nucleic acids often fold back on themselves to form well-de- 
fined structures. Early in evolutionary history, nucleic acids, particularly 
RNA, may have adopted complex and diverse structures both to store genetic 
information and to catalyze its transmission (Section 2.2.2). Such structures 
are also important in all modern organisms in entities such as the ribosome, 
a large complex of RN As and proteins on which proteins are synthesized. 


5'-T AAATTGGTATGCGAATACCAATAG G-3' 5'-UUGGUGGAGUCUGCAACUGACUCCAUUGC A-3' 


FIGURE 5.19 Stem-loop structures. 

Stem-loop structures may be formed from 
single-stranded DNA and RNA molecules. 
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The simplest and most common structural motif formed is a stem-loop, 
created when two complementary sequences within a single strand come to- 
gether to form double-helical structures (Figure 5.19). In many cases, these 
double helices are made up entirely of Watson-Crick base pairs. In other 
cases, however, the structures include mismatched or unmatched (bulged) 
bases. Such mismatches destabilize the local structure but introduce devi- 
ations from the standard double-helical structure that can be important for 
higher-order folding and for function (Figure 5.20). 
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FIGURE 5.20 Complex structure of an RNA molecule. A single-stranded RNA molecule 
may fold back on itself to form a complex structure. (A) The nucleotide sequence showing 
Watson-Crick base pairs and other nonstandard base pairings in stem-loop structures. 

(B) The three-dimensional structure and one important long-range interaction between 
three bases. Hydrogen bonds within the Watson-Crick base pair are shown as dashed black 
lines; additional hydrogen bonds are shown as dashed green lines. 
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simple stem-loops through the interaction of more widely separated bases. DNA Replication 

Often, three or more bases may interact to stabilize these structures. In such 
cases, hydrogen-bond donors and acceptors that do not participate in 
Watson-Crick base pairs may participate in hydrogen bonds of nonstandard 
pairings. Metal ions such as magnesium ion (Mg^^) often assist in the sta- 
bilization of these more elaborate structures. 


5.3 DNA IS REPLICATED BY POLYIVIERASES THAT 
TAKE INSTRUCTIONS FROM TEMPLATES 

We now turn to the molecular mechanism of DNA replication. The full 
replication machinery in cells comprises more than 20 proteins engaged in 
intricate and coordinated interplay. In 1958, Arthur Kornberg and his col- 
leagues isolated the first known of the enzymes, called DNA polymerases, 
that promote the formation of the bonds joining units of the DNA 
backbone. 

5.3.1 DNA Polymerase Catalyzes 
Phosphodiester-Bond Formation 

DNA polymerases catalyze the step-by-step addition of deoxyribonucleotide 
units to a DNA chain (Figure 5.21). Importantly, the new DNA chain is as- 
sembled directly on a preexisting DNA template. The reaction catalyzed, in 
its simplest form, is: 

(DNA), + dNTP (DNA),+i + PPi 

where dNTP stands for any deoxyribonucleotide and PPi is a pyrophosphate 
molecule. The template can be a single strand of DNA or a double strand with 
one of the chains broken at one or more sites. If single stranded, the template 
DNA must be bound to aprímer strand having a free 3'-hydroxyl group. The 
reaction also requires all four activated precursors—^that is, the deoxynucleo- 
side 5'-triphosphates dATP, dGTP, TTP, and dCTP—as well as Mg^^ ion. 


3 ^/ 

X 


dATP 

PPi 


X 

X 

X" 

dGTP 

PPi 


x®^ 

X 

X 


( 

] ( 


V 


c 

J ( 

: A 

V 


( 

] ( 

: A G 

( 

: c 

j T ( 

: A 


( 

: ( 

] 1 

r c 

: A 


( 

: ( 

] 1 

r ( 

: A 

3/ 

X 




3/ 

X 

X 

X 

xb’’ 


3/ 

X 

X 

X 

X/ 


FIGURE 5.21 Polymerízatíon reaction catalyzed by DNA polymerases. 


The chain-elongation reaction catalyzed by DNA polymerases is a nu- 
cleophilic attack by the 3'-hydroxyl group of the primer on the innermost 
phosphorus atom of the deoxynucleoside triphosphate (Figure 5.22). A 
phosphodiester bridge forms with the concomitant release of pyrophos¬ 
phate. The subsequent hydrolysis of pyrophosphate by pyrophosphatase, a 
ubiquitous enzyme, helps drive the polymerization forward. Elongation of 
the DNA chain proceeds in the 5'-to-3' direction. 

DNA polymerases catalyze the formation of a phosphodiester bond ef- 
ficiently only if the base on the incoming nucleoside triphosphate is com- 
plementary to the base on the template strand. Thus, DNA polymerase is 

































a template-directed enzyme that synthesizes a product with a base sequence 
complementary to that of the template. Many DNA polymerases also have 
a separate nuclease activity that allows them to correct mistakes in DNA by 
using a different reaction to remove mismatched nucleotides. These prop- 
erties of DNA polymerases contribute to the remarkably high fidelity of 
DNA replication, which has an error rate of less than per base pair. 

5.3.2 The Genes of Some Viruses Are Made of RNA 

Genes in all cellular organisms are made of DNA. The same is true for some 
viruses, but for others the genetic material is RNA. Viruses are genetic el- 
ements enclosed in protein coats that can move from one cell to another but 
are not capable of independent growth. One well-studied example of an 
RNA virus is the tobacco mosaic virus, which infects the leaves of tobacco 
plants. This virus consists of a single strand of RNA (6930 nucleotides) sur- 
rounded by a protein coat of 2130 identical subunits. An RNA-directed 
RNA polymerase catalyzes the replication of this virai RNA. 

Another important class of RNA virus comprises the retroviruses, so 
called because the genetic information flows from RNA to DNA rather than 
from DNA to RNA. This class includes human immunodeficiency virus 1 
(HIV-1), the cause of AIDS, as well as a number of RNA viruses that pro- 
duce tumor s in susceptible animais. Retro virus particles contain two copies 
of a single-stranded RNA molecule. On entering the cell, the RNA is copied 
into DNA through the action of a virai enzyme called reverse transcriptase 
(Figure 5.23). The resulting double-helical DNA version of the virai 


FIGURE 5.23 Flow of Information from 
RNA to DNA in retroviruses. The RNA 

genome of a retrovirus is converted into 
DNA by reverse transcriptase, an enzyme 
brought into the cell by the infecting virus 
particle. Reverse transcriptase catalyzes 
the synthesis of a complementary DNA 
strand, the digestion of the RNA, and the 
subsequent synthesis of the DNA strand. 



Virai RNA 


DNA-RNA DNA transcript Double-helical 

hybrid of virai RNA virai DNA 
















genome can become incorporated into the chromosomal DNA of the host I 129 | 

and is replicated along with the normal cellular DNA. At a later time, the Gene Expression 

integrated virai genome is expressed to form virai RN A and virai proteins, 
which assemble into new virus particles. 

Note that RN A viruses are not vestiges of the RN A world. Instead, 
fragments of RNA in these viruses have evolved to encode their protein 
coats and other structures needed for transferring from cell to cell and 
replicating. 


5.4 GENE EXPRESSION IS THE TRANSFORMATION 
OF DNA INFORMATION INTO FUNCTIONAL MOLECULES 

The information stored as DNA becomes useful when it is expressed in the 
production of RNA and proteins. This rich and complex topic is the sub- 
ject of several chapters later in this book, but here we introduce the basics 
of gene expression. DNA can be thought of as archival information, stored 
and manipulated judiciously to minimize damage (mutations). It is ex¬ 
pressed in two steps. First, an RNA copy is made. An RNA molecule that 
encodes proteins can be thought of as a photocopy of the original informa¬ 
tion—it can be made in multiple copies, used, and then disposed of. Sec- 
ond, an RNA molecule can be further thought of as encoding directions for 
protein synthesis that must be translated to be of use. The information in 
messenger RNA is translated into a functional protein. Other types of RNA 
molecules exist to facilitate this translation. We now examine the tran- 
scription of DNA information into RNA, the translation of RNA informa¬ 
tion into protein, and the genetic code that links nucleotide sequence with 
amino acid sequence. 


5.4.1 Several Kinds of RNA Play Key Roles in Gene Expression 

Cells contain several kinds of RNA (Table 5.2). 

1. Messenger RNA is the template for protein synthesis or translation. An 
mRNA molecule may be produced for each gene or group of genes that is to 
be expressed in E. coli, whereas a distinct mRNA is produced for each gene 
in eukaryotes. Consequently, mRNA is a heterogeneous class of molecules. In 
E. coli, the average length of an mRNA molecule is about 1.2 kilobases (kb). 

2. Transfer RNA carries amino acids in an activated form to the ribosome 
for peptide-bond formation, in a sequence dictated by the mRNA template. 
There is at least one kind of tRNA for each of the 20 amino acids. Transfer 
RNA consists of about 7 5 nucleotides (having a mass of about 25 kd), which 
makes it one of the smallest of the RNA molecules. 


Kilobase (kb)— 

A unit of length equal to 1000 base 
pairs of a double-stranded nucleic acid 
molecule (or 1000 bases of a single- 
stranded molecule). 

One kilobase of double-stranded 
DNA has a contour length of 0.34 jLtm 
and a mass of about 660 kd. 


I- 

TABLE 5.2 RNA molecules in E. coli 


Type 

Relative 

amount 

(%) 

Sedimentation 

coefficient 

(S) 

Mass 

(kd) 

Number of 
nucleotides 

Ribosomal RNA (rRNA) 

80 

23 

1.2X10^ 

3700 



16 

0.55 X 10^ 

1700 



5 

3.6 X 10^ 

120 

Transfer RNA (tRNA) 

15 

4 

2.5 X 10^ 

75 

Messenger RNA (mRNA) 

5 


Heterogeneous 















3. Ribosomal RNA (rRNA), the major component of ribosomes, plays both 
a catalytic and a structural role in protein synthesis (Section 29.x.x). In E. 
coli, there are three kinds of rRNA, called 23S, 16S, and 5S RNA because 
of their sedimentation behavior. One molecule of each of these species of 
rRNA is present in each ribosome. 

Ribosomal RNA is the most abundant of the three types of RNA. Trans- 
fer RNA comes next, followed by messenger RNA, which constitutes only 
5% of the total RNA. Eukaryotic cells contain additional small RNA mol- 
ecules. Small nuclear RNA (snRNA) molecules, for example, participate in 
the splicing of RNA exons. A small RNA molecule in the cytosol plays a 
role in the targeting of newly synthesized proteins to in- 
tracellular compartments and extracellular destinations. 

5.4.2 All Cellular RNA Is Synthesized by RNA 
Polymerases 

The synthesis of RNA from a DNA template is called 
transcription and is catalyzed by the enzyme RNA poly- 
merase (Figure 5.24). RNA polymerase requires the fol- 
lowing components: 

1. A template. The preferred template is double-stranded 
DNA. Single-stranded DNA also can serve as a template. 
RNA, whether single or double stranded, is not an ef- 
fective template; nor are RNA-DNA hybrids. 

2. Activated precursors. All four ribonucleoside triphos- 
phates —ATP, GTP, UTP, and CTP—are required. 

3. A divalent metal ion. Mg^^ or Mn^^ are effective. 

RNA polymerase catalyzes the initiation and elongation of RNA chains. 
The reaction catalyzed by this enzyme is: 

(RNA)„ residues + ribonucleoside triphosphate (RNA)^+i residues + PPi 

The synthesis of RNA is like that of DNA in several respects (Figure 5.25). 
First, the direction of synthesis is 5' ^ 3'. Second, the mechanism of elon¬ 
gation is similar: the 3'-OH group at the terminus of the growing chain 



FIGURE 5.24 RNA polymerase. A large enzyme 
comprising many subunits including jS (red) and jS' (yellow), which 
form a "claw" that holds the DNA to be transcribed. The active 
site includes a Mg^^ ion (red bali) at the center of the structure. 
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makes a nucleophilic attack on the innermost phosphate of the incoming | 131 | 

nucleoside triphosphate. Third, the synthesis is driven forward by the hy- Gene Expression 

drolysis of pyrophosphate. In contrast with DNA polymerase, however, 

RNA polymerase does not require a primer. In addition, RNA polymerase 
lacks the nuclease capability used by DNA polymerase to excise mis- 
matched nucleotides. 

All three types of cellular RNA—mRNA, tRNA, and rRNA—are syn- 
thesized in E. coli by the same RNA polymerase according to instructions 
given by a DNA template. In mammalian cells, there is a division of labor 
among several different kinds of RNA polymerases. We shall return to these 
RNA polymerases in Chapter 28. 


DNA template 
(plus strand of (/)X174) 


5.4.3 RNA Polymerases Take Instructions from 
DNA Templates 

RNA polymerase, like the DNA polymerases described 
earlier, takes instructions from a DNA template. The 
earliest evidence was the finding that the base composi- 
tion of newly synthesized RNA is the complement of 
that of the DNA template strand, as exemplified by the 
RNA synthesized from a template of single-stranded 
(/)X174 DNA (Table 5.3). Hybridization experiments 
also revealed that RNA synthesized by RNA poly¬ 
merase is complementary to its DNA template. In these 
experiments, DNA is melted and allowed to reassociate in the presence of 
mRNA. RNA-DNA hybrids will form if the RNA and DNA have com¬ 
plementary sequences. The strongest evidence for the fidelity of transcrip- 
tion carne from base-sequence studies showing that the RNA sequence is 
the precise complement of the DNA template sequence (Figure 5.26). 


TABLE 5.3 Base composition (percentage) of RNA 
synthesized from a virai DNA template 


RNA product 


A 

25 

25 

u 

T 

33 

32 

A 

G 

24 

23 

C 

C 

18 

20 

G 


5 '— GCGGCGACGCGCAGUUAAUCCCACAGCCGCCAGUUCCGCUGGCGGCAUUUU— 3 ' mRNA 

3 '— CGCCGCTGCGCGTCAATTAGGGTGTCGGCGGTCAAGGCGACCGCCGTAAAA— 5 Template strand of DNA 

5 '— GCGGCGACGCGCAGTTAATCCCACAGCCGCCAGTTCCGCTGGCGGCATTTT— 3 ' Coding strand of DNA 

FIGURE 5.26 Complementarity between mRNA and DNA. The base sequence of 
mRNA (red) is the complement of that of the DNA template strand (blue). The sequence 
shown here is from the tryptophan operon, a segment of DNA containing the genes for five 
enzymes that catalyze the synthesis of tryptophan. The other strand of DNA (black) is called 
the coding strand because it has the same sequence as the RNA transcript except for 
thymine (T) in place of uracil (U). 


5.4.4 Transcription Begins near Promoter Sites and 
Ends at Terminator Sites 

RNA polymerase must detect and transcribe discrete genes from within 
large stretches of DNA. What marks the beginning of a transcriptional unit? 
DNA templates contain regions called promoter sites that specifically bind 
RNA polymerase and determine where transcription begins. In bactéria, 
two sequences on the 5' (upstream) side of the first nucleotide to be tran- 
scribed function as promoter sites (Figure 5.27A). One of them, called the 
Pribnow box, has the consensus sequence TATAAT and is centered at —10 
(10 nucleotides on the 5' side of the first nucleotide transcribed, which is 
denoted by + 1). The other, called the —35 region, has the consensus se¬ 
quence TTGACA. The first nucleotide transcribed is usually a purine. 

Eukaryotic genes encoding proteins have promoter sites with a 
TATAAA consensus sequence, called a TATA box or a Hogness box, cen¬ 
tered at about —25 (Figure 5.27B). Many eukaryotic promoters also have a 


I- 

Consensus sequence— 

The base sequences of promoter sites 
are not all identical. However, they do 
possess common features, which can 
be represented by an idealized consen¬ 
sus sequence. Each base in the con¬ 
sensus sequence TATAAT is found in a 
majority of prokaryotic promoters. 

Nearly all promoter sequences differ 
from this consensus sequence at only 
one or two bases. 













FIGURE 5.27 Promoter sites for 
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transcription. Promoter sites are required 
for the initiation of transcription in both 

DNA template 
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(A) prokaryotes and (B) eukaryotes. 

Consensus sequences are shown. The first 
nucleotide to be transcribed is numbered 
+ 1. The adjacent nucleotide on the 5' side 
is numbered -1. The sequences shown are 
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FIGURE 5.28 Base sequence of the 5' 
end of an mRNA transcrípt ín £. co/i. 

A stable hairpin structure is followed by a 
sequence of uridine (U) residues. 


CAAT box with a GGNCAATCT consensus sequence centered at about 
— 75. Transcription of eukaryotic genes is further stimulated by enhancer se- 
quences, which can be quite distant (as many as several kilobases) from the 
start site, on either its 5' or its 3' side. 

RNA polymerase proceeds along the DNA template, transcribing one 
of its strands until it reaches a terminator sequence. This sequence encodes 
a termination signal, which in E. coli is a base-paired hairpin on the newly 
synthesized RNA molecule (Figure 5.28). This hairpin is formed by base 
pairing of self-complementary sequences that are rich in G and C. Nascent 
RNA spontaneously dissociates from RNA polymerase when this hairpin 
is followed by a string of U residues. Alternatively, RNA synthesis can be 
terminated by the action of rho, a protein. Less is known about the termi¬ 
nation of transcription in eukaryotes. A more detailed discussion of the ini- 
tiation and termination of transcription will be given in Chapter 28. The 
important point now is that discrete start and stop signals for transcription 
are encoded in the DNA template. 

In eukaryotes, the mRNA is modified after transcription (Figure 5.29). 
A ‘‘cap'' structure is attached to the 5' end, and a sequence of adenylates, 
the poly(A) tail, is added to the 3' end. These modifications will be pre- 
sented in detail in Section 28.x.x. 


FIGURE 5.29 Modífkation of mRNA. 

Messenger RNA in eukaryotes is modified 
after transcription. A nucleotide "cap" 
structure is added to the 5' end, and a 
poly(A) tail is added at the 3' end. 
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5.4.5 Transfer RNA Is the Adaptor Molecule 
in Protein Synthesis 

We have seen that mRNA is the template for protein synthesis. How then 
does it direct amino acids to become joined in the correct sequence to form 
a protein? In 1958, Francis Crick wrote: 

RNA presents mainly a sequence of sites where hydrogen bonding could 
occur. One would expect, therefore, that whatever went onto the template 
in a specific way did so by forming hydrogen bonds. It is therefore a nat¬ 
ural hypothesis that the amino acid is carried to the template by an adap¬ 
tor molecule, and that the adaptor is the part that actually fits onto the 
RNA. In its simplest form, one would require twenty adaptors, one for 
each amino acid. 

This highly innovative hypothesis soon became established as fact. The 
adaptor in protein synthesis is transfer RNA. The structure and reactions of 









these remarkable molecules will be considered in detail in Chapter 29. 
For the moment, it suffices to note that tRNA contains an amino acid- 
attachment site and a template-recognition site. A tRNA molecule carries a 
specific amino acid in an activated form to the site of protein synthesis. The 
carboxyl group of this amino acid is esterified to the 3' - or 2' -hydroxyl group 
of the ribose unit at the 3' end of the tRNA chain (Figure 5.30). The join- 
ing of an amino acid to a tRNA molecule to form an aminoacyl-tRNA is cat- 
alyzed by a specific enzyme called an aminoacyl-tRNA synthetase. This es- 
terification reaction is driven by ATP cleavage. There is at least one specific 
synthetase for each of the 20 amino acids. The template-recognition site on 
tRNA is a sequence of three bases called an anticodon (Figure 5.31). The 
anticodon on tRNA recognizes a complementary sequence of three bases, 
called a codon, on mRNA. 


5.5 AMINO ACIDS ARE ENCODED BY GROUPS OF THREE 
BASES STARTING FROM A FIXED POINT 

The genetic code is the relation between the sequence of bases in DNA (or 
its RN A transcripts) and the sequence of amino acids in proteins. Experi- 
ments by Francis Crick, Sydney Brenner, and others established the fol- 
lowing features of the genetic code by 1961: 

1. Three nucleotides encode an amino acid. Proteins are built from a basic set 
of 20 amino acids, but there are only four bases. Simple calculations show 
that a minimum of three bases is required to encode at least 20 amino acids. 
Genetic experiments showed that an amino acid is in fact encoded by a group 
of three bases, or codon. 

2. The code is nonoverlapping. Consider a base sequence ABCDEF. In an 
overlapping code, ABC specifies the first amino acid, BCD the next, CDE 
the next, and so on. In a nonoverlapping code, ABC designates the first 
amino acid, EFG the second, and so forth. Genetics experiments again es¬ 
tablished the code to be nonoverlapping. 

I ABC I I DEF I I CHI , , JKL , Base sequence 


— aai — aa2—aas—aa4— Amino acid sequence 

3. The code has no punctuation. In principie, one base (denoted as Q) might 
serve as a ‘‘comma’' between groups of three bases. 

. . . QABCQDEFQGHIQJKLQ. . . 

This is not the case. Rather, the sequence of bases is read sequentially from a 
fixed starting point, without punctuation. 

Start 

1 

ABC j DEF j GHI j JKL j MNO 
aa] — aa2—3a3—aa4—aa5 

4. The genetic code is degenerate. Some amino acids are encoded by more 
than one codon, inasmuch as there are 64 possible base triplets and only 20 
amino acids. In fact, 61 of the 64 possible triplets specify particular amino 
acids and 3 triplets (called stop codons) designate the termination of trans- 
lation. Thus, for most amino acids, there is more than one code word. 



FIGURE 5.30 Attachment of an 
amino acid to a tRNA molecule. 

The amino acid (shown in blue) is 
esterified to the 3'-hydroxyl group of 
the terminal adenosine of tRNA. 
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FIGURE 5.31 Symbolic diagram of 
an aminoacyl-tRNA. The amino acid is 
attached at the 3' end of the RNA. The 
anticodon is the template-recognition site. 
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5.5.1 Major Features of the Genetic Code 

All 64 codons have been deciphered (Table 5.4). Because the code is highly 
degenerate, only tryptophan and methionine are encoded by just one triplet 
each. The other 18 amino acids are each encoded by two or more. Indeed, 
leucine, arginine, and serine are specified by six codons each. The number 
of codons for a particular amino acid correlates with its frequency of oc- 
currence in proteins. 

Codons that specify the same amino acid are called synonyms. For ex- 
ample, CAU and CAC are synonyms for histidine. Note that synonyms are 
not distributed haphazardly throughout the genetic code (depicted in Table 
5.4). An amino acid specified by two or more synonyms occupies a single 
box (unless it is specified by more than four synonyms). The amino acids 
in a box are specified by codons that have the same first two bases but dif- 
fer in the third base, as exemplified by GUU, GUC, GUA, and GUG. 
Thus, most synonyms differ only in the last base of the triplet. Inspection of 
the code shows that XYC and XYU always encode the same amino acid, 
whereas XYG and XYA usually encode the same amino acid. The struc- 
tural basis for these equivalences of codons will become evident when we 
consider the nature of the anticodons of tRNA molecules (Section 29.x.x). 

What is the biological significance of the extensive degeneracy of the ge¬ 
netic code? If the code were not degenerate, 20 codons would designate 
amino acids and 44 would lead to chain termination. The probability of mu- 
tating to chain termination would therefore be much higher with a nonde- 
generate code. Chain-termination mutations usually lead to inactive pro¬ 
teins, whereas substitutions of one amino acid for another are usually rather 


TABLE 5.4 The genetic code 


First position 
(5' end) 

Second position 

Third position 
(3' end) 


u 

C 

A 

G 



Phe 

Ser 

Tyr 

Cys 

U 

u 

Phe 

Ser 

Tyr 

Cys 

c 


Leu 

Ser 

Stop 

Stop 

A 


Leu 

Ser 

Stop 

Trp 

G 


Leu 

Pro 

His 

Arg 

U 

c 

Leu 

Pro 

His 

Arg 

c 


Leu 

Pro 

Gin 

Arg 

A 


Leu 

Pro 

Gin 

Arg 

G 


Ile 

Thr 

Asn 

Ser 

U 

A 

Ile 

Thr 

Asn 

Ser 

c 


Ile 

Thr 

Lys 

Arg 

A 


Met 

Thr 

Lys 

Arg 

G 


Vai 

Ala 

Asp 

Gly 

U 

G 

Vai 

Ala 

Asp 

Gly 

c 


Vai 

Ala 

Glu 

Gly 

A 


Vai 

Ala 

Glu 

Gly 

G 


Note: This table identifies the amino acid encoded by each triplet. For example, the codon 
5' AUG 3' on mRNA specifies methionine, whereas CAU specifies histidine. UAA, 
UAG, and UGA are termination signals. AUG is part of the initiation signal, in addition 
to coding for internai methionine residues. 


















harmless. Thus, degeneracy minimizes the deleterious effects of mutations. De- I 135 | 

generacy of the code may also be significant in permitting DNA base com- The Genetic Code 

position to vary over a wide range without altering the amino acid sequence 

of the proteins encoded by the DNA. The G + C content of bacterial DNA 

ranges from less than 30% to more than 70%. DNA molecules with quite 

different G + C contents could encode the same proteins if different syn- 

onyms of the genetic code were consistently used. 


5.5.2 Messenger RNA Contains Start and Stop Signals 
for Protein Synthesis 

Messenger RNA is translated into proteins on ribosomes, large molecular 
complexes assembled from proteins and ribosomal RNA. How is mRNA in- 
terpreted by the translation apparatus? As already mentioned, UAA, UAG, 
and UGA designate chain termination. These codons are read not by tRNA 
molecules but rather by specific proteins called release factors (Section 
29.x.x). Binding of the release factors to the ribosomes releases the newly 
synthesized protein. The start signal for protein synthesis is more complex. 
Polypeptide chains in bactéria start with a modified amino acid—namely, 
formylmethionine (fMet). A specific tRNA, the initiator tRNA, carries 
fMet. This fMet-tRNA recognizes the codon AUG or, less frequently, 
GUG. However, AUG is also the codon for an internai methionine residue, 
and GUG is the codon for an internai valine residue. Hence, the signal for 
the first amino acid in a prokaryotic polypeptide chain must be more com¬ 
plex than that for all subsequent ones. AUG (or GUG) is only part of the ini- 
tiation signal (Figure 5.32). In bactéria, the initiating AUG (or GUG) codon 
is preceded several nucleotides away by a purine-rich sequence that base- 
pairs with a complementary sequence in a ribosomal RNA molecule (Sec¬ 
tion 29.x.x). In eukaryotes, the AUG closest to the 5' end of an mRNA mol¬ 
ecule is usually the start signal for protein synthesis. This particular AUG is 
read by an initiator tRNA conjugated to methionine. Once the initiator AUG 
is located, the readingframe is established —groups of three nonoverlapping 
nucleotides are defined, beginning with the initiator AUG codon. 
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FIGURE 5.32 Initiation of protein 
synthesis. Start signals are required 
for the initiation of protein synthesis in 
(A) prokaryotes and (B) eukaryotes. 


5.5.3 The Genetic Code Is Nearly Universal 

r is the genetic code the same in all organisms? The base sequences 
of many wild-type and mutant genes are known, as are the amino acid 
sequences of their encoded proteins. In each case, the nucleotide change in 
the gene and the amino acid change in the protein are as predicted by the 
genetic code. Furthermore, mRNAs can be correctly translated by the pro¬ 
tein-synthesizing machinery of very different species. For example, human 
hemoglobin mRNA is correctly translated by a wheat germ extract, and bac¬ 
téria efficiently express recombinant DNA molecules encoding human 
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TABLE 5.5 Distinctive codons of human mitochondria 


Codon 

Standard 

code 

Mitochondrial 

code 

UGA 

Stop 

Trp 

UGG 

Trp 

Trp 

AUA 

Ile 

Met 

AUG 

Met 

Met 

AGA 

Arg 

Stop 

AGG 

Arg 

Stop 


proteins such as insulin. These experimental findings strongly suggested 
that the genetic code is universal. 

A surprise was encountered when the sequence of human mitochon- 
drial DNA became known. Human mitochondria read UGA as a codon for 
tryptophan rather than as a stop signal (Table 5.5). Furthermore, AGA and 
AGG are read as stop signals rather than as codons for arginine, and AUA 
is read as a codon for methionine instead of isoleucine. Mitochondria of 
other species, such as those of yeast, also have genetic codes that differ 
slightly from the standard one. The genetic code of mitochondria can dif¬ 
fer from that of the rest of the cell because mitochondrial DNA encodes a 
distinct set of tRNAs. Do any cellular protein-synthesizing systems devi- 
ate from the standard genetic code? Ciliated protozoa differ from most or- 
ganisms in reading UAA and UAG as codons for amino acids rather than 
as stop signals; UGA is their sole termination signal. Thus, the genetic code 
is nearly but not absolutely universal. Variations clearly exist in mitochondria 
and in species, such as ciliates, that branched off very early in eukaryotic 
evolution. It is interesting to note that two of the codon reassignments in 
human mitochondria diminish the information content of the third base of 
the triplet (e.g., both AUA and AUG specify methionine). Most variations 
from the standard genetic code are in the direction of a simpler code. 

Why has the code remained nearly invariant through billions of years of 
evolution, from bactéria to human beings? A mutation that altered the read¬ 
ing of mRNA would change the amino acid sequence of most, if not all, 
proteins synthesized by that particular organism. Many of these changes 
would undoubtedly be deleterious, and so there would be strong selection 
against a mutation with such pervasive consequences. 


5.6 MOST EUKARYOTIC GENES ARE MOSAICS 
OF INTRONS AND EXONS 


In bactéria, polypeptide chains are encoded by a continuous array of triplet 
codons in DNA. For many years, genes in higher organisms also were assumed 
to be continuous. This view was unexpectedly shattered in 1977, when inves- 
tigators in several laboratories discovered that several genes are discontinuous. 
The mosaic nature of eukaryotic genes was revealed by electron microscopic 
studies of hybrids formed between mRNA and a segment of DNA con- 
Intervening sequences taining the corresponding gene (Figure 5.33). For example, the gene for the 

-(introns)-^ jg chain of hemoglobin is interrupted within its amino 

^ ^ I acid-coding sequence by a long intervening sequence of 

240 120 500 550 250 Base pairs 550 base pairs and a short one of 120 base pairs. Thus, 

p-Globin gene the jS-g/obín gene is split into three coding sequences. 


















(A) DNA 



Displaced strand of DNA 



FIGURE 5.33 Detection of ínterveníng 
sequences by electron microscopy. 

An mRNA molecule (shown in red) is 
hybridized to genomic DNA containing the 
corresponding gene. (A) A single loop of 
single-stranded DNA (shown in blue) is 
seen if the gene is continuous. (B) Two 
loops of single-stranded DNA (blue) and 
a loop of double-stranded DNA (blue and 
green) are seen if the gene contains an 
intervening sequence. Additional loops 
are evident if more than one intervening 
sequence is present. 


5.6.1 RNA Processing Generates Mature RNA 

At what stage in gene expression are intervening sequences removed? Newly 
synthesized RNA chains (pre-mRNA) isolated from nuclei are much larger 
than the mRNA molecules derived from them; in the case of j8-globin RNA, 
the former sediment at 15S in zonal centrifugation experiments (Section 
4.1.6) and the latter at 9S. In fact, the primary transcript of the j8-globin 
gene contains two regions that are not present in the mRNA. These inter¬ 
vening sequences in the 15S primary transcript are excised, and the coding se¬ 
quences are simultaneously linked by a precise splicing enzyme toform the ma¬ 
ture 9S mRNA (Figure 5.34). Regions that are removed from the primary 
transcript are called introns (for intervening sequences), whereas those that 
are retained in the mature RNA are called exons (for expressed regions). A 
common feature in the expression of split genes is that their exons are or- 
dered in the same sequence in mRNA as in DNA. Thus, split genes, like 
continuous genes, are colinear with their polypeptide products. 

Splicing is a facile complex operation that is carried out by spliceosomes, 
which are assemblies of proteins and small RNA molecules (Section 28.x.x). 
This enzymatic machinery recognizes signals in the nascent RNA that spec- 
ify the splice sites. Introns nearly always begin with GU and end with an AG 
that is preceded by a pyrimidine-rich tract (Figure 5.35). This consensus se¬ 
quence is part of the signal for splicing. 


5' 3' 

splice site splice site 



Intron 

FIGURE 5.35 Consensus sequence for the splicing of mRNA precursors. 
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FIGURE 5.34 Transcription and 
Processing of the j3-globin gene. 

The gene is transcribed to yield the 
primary transcript, which is modified by 
cap and poly(A) addition. The intervening 
sequences in the primary RNA transcript 
are removed to form the mRNA. 


5.6.2 Many Exons Encode Protein Domains 

Y Most genes of higher eukaryotes, such as birds and mammals, are 
split. Lower eukaryotes, such as yeast, have a much higher propor- 
tion of continuous genes. In prokaryotes, split genes are extremely rare. 
Have introns been inserted into genes in the evolution of higher organisms? 
Or have introns been removed from genes to form the streamlined genomes 
of prokaryotes and simple eukaryotes? Comparisons of the DNA sequences 
of genes encoding proteins that are highly conserved in evolution suggest 
that introns were present in ancestral genes and were lost in the evolution of 
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FIGURE 5.36 Exon shuffling. Exons can be readily 
shuffled by recombination of DNA to expand the genetic 
repertoire. 
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organisms that have become optimized for very rapid growth, 
such as prokaryotes. The positions of introns in some genes are 
at least 1 billion years old. Furthermore, a common mecha- 
nism of splicing developed before the divergence of fungi, 
plants, and vertebrates, as shown by the finding that mam- 
malian cell extracts can splice yeast RN A. Many exons encode 
discrete structural and functional units of proteins. An attrac- 
tive hypothesis is that new proteins arose in evolution by the 
rearrangement of exons encoding discrete structural elements, 
binding sites, and catalytic sites, a process called exon shuffling. 
Because it preserves functional units but allows them to in- 
teract in new ways, exon shuffling is a rapid and efficient 
means of generating novel genes (Figure 5.36). Introns are ex- 
tensive regions in which DNA can break and recombine with 
no deleterious effect on encoded proteins. In contrast, the exchange of se- 
quences between different exons usually leads to loss of function. 

Another advantage conferred by split genes is the potentiality for generat¬ 
ing a series of related proteins by splicing a nascent RNA transcript in differ¬ 
ent ways. For example, a precursor of an antibody-producing cell forms an 
antibody that is anchored in the celbs plasma membrane (Figure 5.37). Stim- 
ulation of such a cell by a specific foreign antigen that is recognized by the 
attached antibody leads to cell differentiation and proliferation. The acti- 
vated antibody-producing cells then splice their nascent RNA transcript in 
an alternative manner to form soluble antibody molecules that are secreted 
rather than retained on the cell surface. We see here a clear-cut example of 
a benefit conferred by the complex arrangement of introns and exons in 
higher organisms. Alternative splicing is a facile means of forming a set of 
proteins that are variations of a basic motif according to a developmental pro- 
gram without requiring a gene for each protein. 
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FIGURE 5.37 Alternative splicing. 

Alternative splicing generates mRNAs that 
are templates for different forms of a 
protein: (A) a membrane-bound antibody 
on the surface of a lymphocyte, and 
(B) its soluble counterpart, exported from 
the cell. The membrane-bound antibody is 
anchored to the plasma membrane by a 
helical segment (highlighted in yellow) 
that is encoded by its own exon. 


SUMMARY 


O A Nucleic Acid Consists of Four Kinds of Bases Linked 
to a Sugar-Phosphate Backbone 

DNA and RNA are linear polymers of a limited number of monomers. 
In DNA, the repeating units are nucleotides with the sugar being a de- 
oxyribose and the bases being adenine (A), thymine (T), guanine (G), 
and cytosine (C). In RNA, the sugar is a ribose and the base uracil (U) 
is used in place of thymine. DNA is the molecule of heredity in all 
prokaryotic and eukaryotic organisms. In viruses, the genetic material 
is either DNA or RNA. 

o A Pair of Nucleic Acid Chains with Complementary Sequences Can Form 
a Double-Helical Structure 

All cellular DNA consists of two very long, helical polynucleotide 
chains coiled around a common axis. The sugar-phosphate backbone of 
each strand is on the outside of the double helix, whereas the purine and 
pyrimidine bases are on the inside. The two chains are held together by 
hydrogen bonds between pairs of bases: adenine is always paired with 
thymine, and guanine is always paired with cytosine. Hence, one strand 
of a double helix is the complement of the other. The two strands of the 
double helix run in opposite directions. Genetic information is encoded 
in the precise sequence of bases along a strand. Most RNA molecules 
are single stranded, but many contain extensive double-helical regions 
that arise from the folding of the chain into hairpins. 
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• DNA Is Replicated by Polymerases That Take Instructíons from Templates 

In the replication of DNA, the two strands of a double helix unwind 
and separate as new chains are synthesized. Each parent strand acts as 
a template for the formation of a new complementary strand. Thus, the 
replication of DNA is semiconservative—each daughter molecule re- 
ceives one strand from the parent DNA molecule. The replication of 
DNA is a complex process carried out by many proteins, including sev- 
eral DNA polymerases. The activated precursors in the synthesis of 
DNA are the four deoxyribonucleoside 5'-triphosphates. The new 
strand is synthesized in the 5'^ 3' direction by a nucleophilic attack 
by the 3'-hydroxyl terminus of the primer strand on the innermost 
phosphorus atom of the incoming deoxyribonucleoside triphosphate. 

Most important, DNA polymerases catalyze the formation of a phos- 
phodiester bond only if the base on the incoming nucleotide is comple¬ 
mentary to the base on the template strand. In other words, DNA poly¬ 
merases are template-directed enzymes. The genes of some viruses, such 
as tobacco mosaic virus, are made of single-stranded RNA. An RNA- 
directed RNA polymerase mediates the replication of this virai RNA. 

Retroviruses, exemplified by HIV-1, have a single-stranded RNA 
genome that is transcribed into double-stranded DNA by reverse tran- 
scriptase, an RNA-directed DNA polymerase. 

• Gene Expression Is the Transformation of DNA Information 
into Functional Molecules 

The flow of genetic Information in normal cells is from DNA to RNA 
to protein. The synthesis of RNA from a DNA template is called tran- 
scription, whereas the synthesis of a protein from an RNA template is 
termed translation. Cells contain several kinds of RNA: messenger RNA 
(mRNA), transfer RNA (tRNA), and ribosomal RNA (rRNA), which 
vary in size from 75 to more than 5000 nucleotides. All cellular RNA is 
synthesized by RNA polymerases according to instructions given by 
DNA templates. The activated intermediates are ribonucleoside triphos- 
phates and the direction of synthesis, like that of DNA, is 5' ^ 3'. RNA 
polymerase differs from DNA polymerase in not requiring a primer. 

• Amino Acids Are Encoded by Groups of Three Bases Starting from a 
Fixed Point 

The genetic code is the relation between the sequence of bases in DNA 
(or its RNA transcript) and the sequence of amino acids in proteins. Amino 
acids are encoded by groups of three bases (called codons) starting from a 
fixed point. Sixty-one of the 64 codons specify particular amino acids, 
whereas the other 3 codons (UAA, UAG, and UGA) are signals for chain 
termination. Thus, for most amino acids, there is more than one code word. 

In other words, the code is degenerate. The genetic code is nearly the same 
in all organisms. Natural mRNAs contain start and stop signals for trans¬ 
lation, just as genes do for directing where transcription begins and ends. 

• Most Eukaryotic Genes Are Mosaics of Introns and Exons 

Most genes in higher eukaryotes are discontinuous. Coding sequences 
(exons) in these split genes are separated by intervening sequences (in¬ 
trons), which are removed in the conversion of the primary transcript 
into mRNA and other functional mature RNA molecules. Split genes, 
like continuous genes, are colinear with their polypeptide products. A 
striking feature of many exons is that they encode functional domains 
in proteins. New proteins probably arose in the course of evolution by 
the shuffling of exons. Introns may have been present in primordial 
genes but were lost in the evolution of such fast-growing organisms as 
bactéria and yeast. 
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deoxyribonucleic acid (DNA) (p. 118) 
deoxyribose (p. 118) 
ribose (p. 118) 

ribonucleic acid (RNA) (p. 118) 

purine (p. 119) 

pyrimidine (p. 119) 

núcleoside (p. 120) 

núcleotide (p. 120) 

replication (p. 121) 

double helix (p. 121) 

semiconservative replication (p. 123) 


DNA polymerase (p. 127) 

template (p. 127) 

primer (p. 127) 

rever se transcriptase (p. 128) 

messenger RNA (mRNA) (p. 129) 

translation (p. 129) 

transfer RNA (tRNA) (p. 129) 

ribosomal RNA (rRNA) (p. 130) 

small nuclear RNA (snRNA) (p. 130) 

transcription (p. 130) 

RNA polymerase (p. 130) 


promoter site (p. 131) 
codon (p. 133) 
genetic code (p. 133) 
ribosome (p. 135) 
intron (p. 137) 
exon (p. 137) 
splicing (p. 137) 
spliceosomes (p. 137) 
exon shuffling (p. 138) 
alternative splicing (p. 138) 
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-1 PROBLEMS I- 

1. Complements. Write the complementary sequence (in the 
standard 5'^ 3' notation) for (a) GATCAA, (b) TCGAAC, 
(c) ACGCGT, and (d) TACCAT. 

2. Compositional constraint. The composition (in mole-fraction 
units) of one of the strands of a double-helical DNA molecule is 
[A] = 0.30 and [G] = 0.24. (a) What can you say about [T] and 
[C] for the same strand? (b) What can you say about [A], [G], 
[T], and [C] of the complementary strand? 

3. Lost DNA. The DNA of a deletion mutant of A bacterio- 
phage has a length of 15 pim instead of 17 pim. How many base 
pairs are missing from this mutant? 

4. An unseen pattern. What result would Meselson and Stahl 
have obtained if the replication of DNA were conservative (i.e., 
the parental double helix stayed together)? Give the expected 
distribution of DNA molecules after 1.0 and 2.0 generations for 
conservative replication. 

5. Tagging DNA. (a) Suppose that you want to radioactively la¬ 
bei DNA but not RNA in dividing and growing bacterial cells. 
Which radioactive molecule would you add to the culture 
médium? (b) Suppose that you want to prepare DNA in which 
the backbone phosphorus atoms are uniformly labeled with 
Which precursors should be added to a solution containing 
DNA polymerase I and primed template DNA? Specify the po- 
sition of radioactive atoms in these precursors. 

6. Finding a template. A solution contains DNA polymerase I 
and the Mg^^ salts of dATP, dGTP, dCTP, and TTP. The 
foliowing DNA molecules are added to aliquots of this solution. 
Which of them would lead to DNA synthesis? (a) A single- 
stranded closed circle containing 1000 nucleotide units. (b) A 
double-stranded closed circle containing 1000 nucleotide pairs. 
(c) A single-stranded closed circle of 1000 nucleotides base- 
paired to a linear strand of 500 nucleotides with a free 3'-OH 
terminus. (d) A double-stranded linear molecule of 1000 nu¬ 
cleotide pairs with a free 3'-OH group at each end. 

7. The right start. Suppose that you want to assay reverse tran- 
scriptase activity. If polyriboadenylate is the template in the as¬ 
say, what should you use as the primer? Which radioactive nu¬ 
cleotide should you use to follow chain elongation? 

8. Essential degradation. Reverse transcriptase has ribonucle- 
ase activity as well as polymerase activity. What is the role of its 
ribonuclease activity? 

9. Vírus hunting. You have purified a virus that infects turnip 
leaves. Treatment of a sample with phenol removes virai pro- 
teins. Application of the residual material to scraped leaves re- 
sults in the formation of progeny virus particles. You infer that 
the infectious substance is a nucleic acid. Propose a simple and 
highly sensitive means of determining whether the infectious nu¬ 
cleic acid is DNA or RNA. 

10. Mutagenic consequences. Spontaneous deamination of cyto- 
sine bases in DNA occurs at low but measurable frequency. Cy- 
tosine is converted into uracil by loss of its amino group. After 
this conversion, which base pair occupies this position in each of 
the daughter strands resulting from one round of replication? 
Two rounds of replication? 


11. Information content. (a) How many different 8-mer se- 
quences of DNA are there? (Hint: There are 16 possible dinu- 
cleotides and 64 possible trinucleotides.) (b) How many bits of 
information are stored in an 8-mer DNA sequence? In the E. coli 
genome? In the human genome? (c) Compare each of these val- 
ues with the amount of information that can be stored on a per- 
sonal Computer diskette. A byte is equal to 8 bits. 

12. Key polymerases. Compare DNA polymerase I and RNA 
polymerase from E. coli in regard to each of the folio wing fea- 
tures: (a) activated precursors, (b) direction of chain elongation, 
(c) conservation of the template, and (d) need for a primer. 

13. Encoded sequences. (a) Write the sequence of the mRNA mol¬ 
ecule synthesized from a DNA template strand having the sequence 

5' - ATCGTACCGTTA- 3' 

(b) What amino acid sequence is encoded by the following base 
sequence of an mRNA molecule? Assume that the reading frame 
starts at the 5' end. 

5' -UUGCCUAGUGAUUGGAUG- 3' 

(c) What is the sequence of the polypeptide formed on addition 
of poly(UUAC) to a cell-free protein-synthesizing system? 

14. A tougher chain. RNA is readily hydrolyzed by alkali, 
whereas DNA is not. Why? 

15. A potent blocker. How does cordycepin (3'-deoxyadenosine) 
block the synthesis of RNA? 

16. Silent RNA. The code word GGG cannot be deciphered in 
the same way as can UUU, CCC, and AAA, because poly(G) 
does not act as a template. Poly(G) forms a triple-stranded heli- 
cal structure. Why is it an ineffective template? 

17. Two from one. Synthetic RNA molecules of defined sequence 
were instrumental in deciphering the genetic code. Their syn¬ 
thesis first required the synthesis of DNA molecules to serve 
as a template. H. Gobind Khorana synthesized, by organic- 
chemical methods, two complementary deoxyribonucleotides, 
each with nine residues: d(TAC )3 and d(GTA) 3 . Partly over- 
lapping duplexes that formed on mixing these oligonucleotides 
then served as templates for the synthesis by DNA polymerase 
of long, repeating double-helical DNA chains. The next step was 
to obtain long polyribonucleotide chains with a sequence com¬ 
plementary to only one of the two DNA strands. How did he 
obtain only poly(UAC)? Only poly(GUA)? 

18. Overlapping or not. In a nonoverlapping triplet code, each 
group of three bases in a sequence ABCDEF ... specifies only 
one amino acid—ABC specifies the first, DEF the second, and 
so forth—whereas, in a completely overlapping triplet code, 
ABC specifies the first amino acid, BGD the second, CDE the 
third, and so forth. Assume that you can mutate an individual 
nucleotide of a codon and detect the mutation in the amino acid 
sequence. Design an experiment that would establish whether 
the genetic code is overlapping or nonoverlapping. 

19. Triple entendre. The RNA transcript of a region of T4 phage 
DNA contains the sequence 5'-AAAUGAGGA-3'. This se¬ 
quence encodes three different polypeptides. What are they? 
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20. Valuable synonyms. Proteins generally have low contents of 
Met and Trp, intermediate ones of His and Cys, and high ones 
of Leu and Ser. What is the relation between the number of 
codons of an amino acid and its frequency of occurrence in pro¬ 
teins? What might be the selective advantage of this relation? 

21. A new translation. A transfer RNA with a UGU anticodon 
is enzymatically conjugated to ^"'^C-labeled cysteine. The cys- 
teine unit is then chemically modified to alanine (with the use 
of Raney nickel, which removes the sulfur atom of cysteine). The 
altered aminoacyl-tRNA is added to a protein-synthesizing Sys¬ 
tem containing normal components except for this tRNA. The 
mRNA added to this mixture contains the following sequence: 

5'-UUUUGCCAUGUUUGUGCU-3' 

What is the sequence of the corresponding radiolabeled peptide? 

Chapter Integration Problems 

22. Eons ago. The atmosphere of the primitive Earth before the 
emergence of life contained N 2 , NH 3 , H 2 , HCN, CO, and H 2 O. 
Which of these compounds is a likely precursor of most of the 
atoms in adenine? Why? 

23. Back to the bench. A protein chemist told a molecular ge- 
neticist that he had found a new mutant hemoglobin in which 


aspartate replaced lysine. The molecular geneticist expressed 
surprise and sent his friend scurrying back to the laboratory. (a) 
Why did the molecular geneticist doubt the reported amino acid 
substitution? (b) Which amino acid substitutions would have 
been more palatable to the molecular geneticist? 

24. Eons apart. The amino acid sequences of a yeast protein and 
a human protein carrying out the same function are found to be 
60% identical. However, the corresponding DNA sequences are 
only 45% identical. Account for this differing degree of identity. 

Media Problem 

25. More than one way to pair a base. Genetic mutations can 
arise due to nonstandard base pairing during DNA replication. 
Such mispairing can be made more likely by the chemical mod- 
ification of bases (which is how mutagens work). One example 
is oxidation of guanine to 8-oxoguanine. An effect of this mod- 
ification is to introduce some steric strain into the anti configu- 
ration of the glycosylic bond, making the syn configuration more 
favorable than usual. Look at the Media Problem section of the 
Structural Insights module on nucleic acids and explain why 
8-oxoguanine often mispairs with adenine. 
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Processes such as development from a Caterpillar into a butterfly involve 
dramatic changes in patterns of gene expression. The expression leveis of 
thousands of genes can be monitored through the use of DNA arrays. At right, a 
GeneChip® reveals the expression leveis of more than 12,000 human genes; the 
brightness of each spot indicates the expression levei of the corresponding gene. 
[(Left) Roger Hart/Rainbow. (Right) GeneChip courtesy of Affynnetrix.] 


OUTLINE 


Recombinant DNA technology has revolutionized biochemistry since it 
came into being in the 1970s. The genetic endowment of organisms can now 
be precisely changed in designed ways. Recombinant DNA technology is a 
fruit of several decades of basic research on DNA, RNA, and viruses. It de- 
pends, first, on having enzymes that can cut, join, and replicate DNA and 
rever se transcribe RNA. Restriction enzymes cut very long DNA molecules 
into specific fragments that can be manipulated; DNA ligases join the frag- 
ments together. The availability of many kinds of restriction enzymes and 
DNA ligases makes it feasible to treat DNA sequences as modules that can 
be moved at will from one DNA molecule to another. Thus, recombinant 
DNA technology is based on nucleic acid enzymology. 

A second foundation is the base-pairing language that 
allows complementary sequences to recognize and bind to 
each other. Hybridization with complementary DNA or 
RNA probes is a sensitive and powerful means of detecting 
specific nucleotide sequences. In recombinant DNA tech¬ 
nology, base-pairing is used to construct new combinations 
of DNA as well as to detect and amplify particular se¬ 
quences. This revolutionary technology is also critically de- 
pendent on our understanding of viruses, the ultimate par¬ 
asites. Viruses efficiently deliver their own DNA (or RNA) 
into hosts, subverting them either to replicate the virai 
genome and produce virai proteins or to incorporate virai 
DNA into the host genome. Likewise, plasmids, which are 
accessory chromosomes found in bactéria, have been indis- 
pensable in recombinant DNA technology. 


6.1 The Basic Tools of Gene Exploration 

6.2 Recombinant DNA Technology Has 
Revolutionized All Aspects of Biology 

6.3 Manipulating the Genes 
of Eukaryotes 

6.4 Novel Proteins Can Be Engineered 
by Site-Specific Mutagenesis 
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Palindrome— 

A Word, sentence, or verse that reads 
the same from right to left as it does 
from left to right. 

Radar 

Madam, Tm Adam 

Abie was I ere I saw Eiba 

Roma tibi súbito motibus ibit amor 

Derived from the Greek palindromos, 
"running back again." 


These new methods have wide-ranging benefits. Entire genomes, in- 
cluding the human genome, are being deciphered. New insights are emerg- 
ing, for example, into the regulation of gene expression in câncer and devel- 
opment and the evolutionary history of proteins as well as organisms. New 
proteins can be created by altering genes in specific ways to provide detailed 
views into protein function. Clinically useful proteins, such as hormones, are 
now synthesized by recombinant DNA techniques. Crops are being gener- 
ated to resist pests and harsh conditions. The new opportunities opened by 
recombinant DNA technology promise to have broad effects. 


6.1 THE BASIC TOOLS OF GENE EXPLORATION 

The rapid progress in biotechnology—indeed its very existence—is a result 
of a relatively few techniques. 

1. Restriction-enzyme analysis. Restriction enzymes are precise, molecular 
scalpels that allow the investigator to manipulate DNA segments. 

2. Blotting techniques. The Southern and Northern blots are used to sepa- 
rate and characterize DNA and RNA, respectively. The Western blot, which 
uses antibodies to characterize proteins, was described in Section 4.3.4. 

3. DNA sequencing. The precise nucleotide sequence of a molecule of DNA 
can be determined. Sequencing has yielded a wealth of information concern- 
ing gene architecture, the control of gene expression, and protein structure. 

4. Solid-phase synthesis of nucleic acids. Precise sequences of nucleic acids can 
be synthesized de novo and used to identify or amplify other nucleic acids. 

5. The polymerase chain reaction (PCR). The polymerase chain reaction 
leads to a billionfold amplification of a segment of DNA. One molecule of 
DNA can be amplified to quantities that permit characterization and ma- 
nipulation. This powerful technique is being used to detect pathogens and 
genetic diseases, to determine the source of a hair left at the scene of a crime, 
and to resurrect genes from fossils. 

A final tool, the use of which will be highlighted in the next chapter, is 
the Computer. Without the Computer, it would be impossible to catalog, ac- 
cess, and characterize the abundant information, especially DNA sequence 
information, that the techniques just outlined are rapidly generating. 

6.1.1 Restriction Enzymes Split DNA into Specific Fragments 

Restriction enzymes, also called restriction endonucleases, recognize specific 
base sequences in double-helical DNA and cleave, at specific places, both 
strands of a duplex containing the recognized sequences. To biochemists, 
these exquisitely precise scalpels are marvelous gifts of nature. They are in- 
dispensable for analyzing chromosome structure, sequencing very long 
DNA molecules, isolating genes, and creating new DNA molecules that can 
be cloned. Werner Arber and Hamilton Smith disco ver ed restriction en¬ 
zymes, and Daniel Nathans pioneered their use in the late 1960s. 

Restriction enzymes are found in a wide variety of prokaryotes. Their 
biological role is to cleave foreign DNA molecules. The celfs own DNA is 
not degraded, because the sites recognized by its own restriction enzymes 
are methylated. Many restriction enzymes recognize specific sequences of 
four to eight base pairs and hydrolyze a phosphodiester bond in each strand 
in this region. A striking characteristic of these cleavage sites is that they al- 
most always possess twofold rotational symmetry. In other words, the recog¬ 
nized sequence is palindromic, or an inverted repeat, and the cleavage sites 







are symmetrically positioned. For example, the sequence recognized by a 
restriction enzyme from Streptomyces achromogenes is: 

Cleavage site 

i 

5' C —C —G —C —G—G 3' 

i i i • i 1 i 

3' G-G-Cf G-C-C 5' 

í r 

cleavage site ^ Symmetry axis 

In each strand, the enzyme cleaves the C-G phosphodiester bond on the 3' 
side of the symmetry axis. As we shall see in Chapter 9, this symmetry re- 
flects that of structures of the restriction enzymes themselves. 

More than 100 restriction enzymes have been purified and character- 
ized. Their names consist of a three-letter abbreviation for the host organ- 
ism (e.g., Eco for Escherichia coli, Hin for Haemophilus influenzae, Hae for 
Haemophilus aegyptius) followed by a strain designation (if needed) and a 
roman numeral (if more than one restriction enzyme from the same strain 
has been identified). The specificities of several of these enzymes are shown 
in Figure 6.1. Note that the cuts may be staggered or even. 

Restriction enzymes are used to cleave DNA molecules into specific 
fragments that are more readily analyzed and manipulated than the entire 
parent molecule. For example, the 5.1-kb circular duplex DNA of the tu- 
mor-producing SV40 virus is cleaved at 1 site by EcoKl, 4 sites by Hpal, 
and 11 sites by fímdlll. A piece of DNA produced by the action of one re¬ 
striction enzyme can be specifically cleaved into smaller fragments by an- 
other restriction enzyme. The pattern of such fragments can serve as a^n- 
gerprint of a DNA molecule, as will be discussed shortly. Indeed, complex 
chromosomes containing hundreds of millions of base pairs can be mapped 
by using a series of restriction enzymes. 

6.1.2 Restriction Fragments Can Be Separated 
by Gel Electrophoresis and Visualized 

Small differences between related DNA molecules can be readily detected 
because their restriction fragments can be separated and displayed by gel 
electrophoresis. In many types of gels, the electrophoretic mobility of a 
DNA fragment is inversely proportional to the logarithm of the number of 
base pairs, up to a certain limit. Polyacrylamide gels are used to separate 
fragments containing about as many as 1000 base pairs, whereas more 
porous agarose gels are used to resolve mixtures of larger fragments (about 
as many as 20 kb). An important feature of these gels is their high resolv- 
ing power. In certain kinds of gels, fragments differing in length by just one 
nucleotide of several hundred can be distinguished. Moreover, entire chromo¬ 
somes containing millions of nucleotides can be separated on agarose gels 
by applying pulsed electric fields (pulsed-field gel electrophoresis, PFGE) 
in different directions. This technique depends on the differential stretch- 
ing and relaxing of large DNA molecules as an electric field is turned off 
and on at short intervals. Bands or spots of radioactive DNA in gels can be 
visualized by autoradiography (Section 4.1.4). Alternatively, a gel can be 
stained with ethidium bromide, which fluoresces an intense orange when 
bound to double-helical DNA molecule (Figure 6.2). A band containing 
only 50 ng of DNA can be readily seen. 


FIGURE 6.2 Gel electrophoresis pattern of a restriction digest. This gel shows the 
fragments produced by cleaving SV40 DNA with each of three restriction enzymes. These 
fragments were made fluorescent by staining the gel with ethidium bromide. [Courtesy of 
Dr. Jeffrey Skiar.] 
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5' GGATCC 3' 
3' CCTAGG 5' 
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5' GAATTC 3' 
3' CTTAAG 5' 
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5' GGCC 3' 

3' CCGG 5' 
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5' GCGC 3' 

3' CGCG 5' 
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5' CTCGAG 3' 
3' GAGCTC 5' 
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BamH\ 


Eco RI 


Hae\\\ 


Hha\ 


Xho\ 


FIGURE 6.1 Specificities of some 
restriction endonucleases. The base- 
pair sequences that are recognized by 
these enzymes contain a twofold axis of 
symmetry. The two strands in these regions 
are related by a 180-degree rotation about 
the axis marked by the green Symbol. The 
cleavage sites are denoted by red arrows. 
The abbreviated name of each restriction 
enzyme is given at the right of the 
sequence that it recognizes. 
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FIGURE 6.3 Southern blotting. A DNA 

fragment containing a specific sequence 
can be identified by separating a mixture 
of fragments by electrophoresis, 
transferring them to nitrocellulose, and 
hybridizing with a ^^P-labeled probe 
complementary to the sequence. The 
fragment containing the sequence is then 
visualized by autoradiography. 


DNA 
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gel 


Transfer 
of DNA 
by blotting 
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^^P-labeled 
DNA probe 

-> 
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Autoradiography 
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DNA 
^ probe 
revealed 


Autoradiogram 


I- 

Restriction-fragment-length 

polymorphism (RFLP)— 

Southern blotting can be used to 
follow the inheritance of selected 
genes. Mutations within restriction sites 
change the sizes of restriction 
fragments and hence the positions of 
bands in Southern-blot analyses. The 
existence of genetic diversity in a 
population is termed polymorphism. 

The detected mutation may itself cause 
disease or it may be closely linked to 
one that does. Genetic diseases such 
as sickie-cell anemia, cystic fibrosis, 
and Huntington chorea can be 
detected by RFLP analyses. 


A restriction fragment containing a specific base sequence can be iden¬ 
tified by hybridizing it with a labeled complementary DNA strand (Figure 
6.3). A mixture of restriction fragments is separated by electrophoresis 
through an agarose gel, denatured to form single-stranded DNA, and trans- 
ferred to a nitrocellulose sheet. The positions of the DNA fragments in the 
gel are preserved on the nitrocellulose sheet, where they are exposed to a 
^^P-labeled single-stranded DNA probe, The probe hybridizes with a re¬ 
striction fragment having a complementary sequence, and autoradiography 
then reveals the position of the restriction-fragment-probe duplex. A par¬ 
ticular fragment in the midst of a million others can be readily identified in 
this way, like finding a needle in a haystack. This powerful technique is 
known as Southern blotting because it was devised by Edwin Southern. 

Similar ly, RN A molecules can be separated by gel electrophoresis, and 
specific sequences can be identified by hybridization subsequent to their 
transfer to nitrocellulose. This analogous technique for the analysis of RNA 
has been whimsically termed Northern blotting. A further play on words ac- 
counts for the term Western blotting, which refers to a technique for detect- 
ing a particular protein by staining with specific antibody (Section 4.3.4). 
Southern, Northern, and Western blots are also known respectively as DNA, 
RNA, and protein blots. 

6.1.3 DNA Is Usually Sequenced by Controlled Termination 
of Replication (Sanger Dideoxy Method) 

The analysis of DNA structure and its role in gene expression also have 
been markedly facilitated by the development of powerful techniques for 
the sequencing of DNA molecules. The key to DNA sequencing is the gen- 
eration of DNA fragments whose length depends on the last base in the se¬ 
quence. Collections of such fragments can be generated through the con¬ 
trolled interruption of enzymatic replication, a method developed by 
Frederick Sanger and coworkers. This technique has superseded alternative 
methods because of its simplicity. The same procedure is performed on four 
reaction mixtures at the same time. In all these mixtures, a DNA polymerase 
is used to make the complement of a particular sequence within a single- 
stranded DNA molecule. The synthesis is primed by a fragment, usually 
obtained by chemical synthetic methods described in Section 6.1.4, that is 
complementary to a part of the sequence known from other studies. In ad- 
dition to the four deoxyribonucleoside triphosphates (radioactively labeled), 
each reaction mixture contains a small amount of the 2',3'-dideoxy analog 
of one of the nucleotides, a different nucleotide for each reaction mixture. 





















































DNA to be sequenced 



FIGURE 6.4 Strategy of the chaín-termínation 
method for sequencing DNA. Fragments are 
produced by adding the 2',3'-dideoxy analog of a 
dNTP to each of four polymerization mixtures. For 
example, the addition of the dideoxy analog of 
dATP (shown in red) resuits in fragments ending in 
A. The dideoxy analog cannot be extended. 


3'-GAATTCGCTAATGC- 

5—CTTAA 


Primer 


DNA polymerase I 
Labeled dATP, dTTP, 
dCTP, dGTP 

Dideoxy analog of dATP 


3'-GAATTCGCTAATGC- 

5— CTTAAGCGATTA 
+ 

3'-GAATTCGCTAATGC- 

5— CTTAAGCGA 


New DNA strands are separated 
and electrophoresed 


Basic Tools 


The incorporation of this analog blocks further growth of the new chain be- 
cause it lacks the 3'-hydroxyl terminus needed to form the next phospho- 
diester bond. The concentration of the dideoxy analog is low enough that 
chain termination will take place only occasionally. The polymerase will 
sometimes insert the correct nucleotide and other times the dideoxy analog, 
stopping the reaction. For instance, if the dideoxy analog of dATP is pres- 
ent, fragments of various lengths are produced, but all will be terminated 
by the dideoxy analog (Figure 6.4). Importantly, this dideoxy analog of 
dATP will be inserted only where a T was located in the DNA being se¬ 
quenced. Thus, the fragments of different length will correspond to the po- 
sitions of T. Four such sets of chain-terminated fragments 
(one for each dideoxy analog) then undergo electrophore- 
sis, and the base sequence of the new DNA is read from the 
autoradiogram of the four lanes. 

Fluorescence detection is a highly effective alternative 
to autoradiography. A fluorescent tag is attached to an 
oligonucleotide priming fragment—a differently colored 
one in each of the four chain-terminating reaction mixtures 
(e.g., a blue emitter for termination at A and a red one for 
termination at C). The reaction mixtures are combined and 
subjected to electrophoresis together. The separated bands 
of DNA are then detected by their fluorescence as they 
emerge from the gel; the sequence of their colors directly 
gives the base sequence (Figure 6.5). Sequences of as many 
as 500 bases can be determined in this way. Alternatively, 
the dideoxy analogs can be labeled, each with a specific flu¬ 
orescent labei. When this method is used, all four termina- 
tors can be placed in a single tube, and only one reaction is 
necessary. Fluorescence detection is attractive because it 
eliminates the use of radioactive reagents and can be read- 
ily automated. 

Sanger and coworkers determined the complete sequence of the 5386 bases 
in the DNA of the (/)X174 DNA virus in 1977, just a quarter century after 
Sanger s pioneering elucidation of the amino acid sequence of a protein. This 
accomplishment is a landmark in molecular biology because it revealed the 
total Information content of a DNA genome. This tour de force was followed 
several years later by the determination of the sequence of human mitochon- 
drial DNA, a double-stranded circular DNA molecule containing 16,569 base 
pairs. It encodes 2 ribosomal RNAs, 22 transfer RNAs, and 13 proteins. In 



Oligonucleotide length 

FIGURE 6.5 Fluorescence detection of 
oligonucleotide fragments produced 
by the dideoxy method. Each of the 
four chain-terminating mixtures is primed 
with a tag that fluoresces at a different 
wavelength (e.g., blue for A). The 
sequence determined by fluorescence 
measurements at four wavelengths is 
shown at the bottom. [From L. M. Smith, 

J. Z. Sanders, R. J. Kaiser, P. Hughes, C. Dodd, 

C. R. Connell, C. Heiner, S. B. H. Kent, and L. E. 
Hood. Nature 321 (1986):674.] 





















recent years, the complete genomes of free-living organ- 
isms have been sequenced. The first such sequence to be 
completed was that of the bacterium Haemophilus in- 
fluenzae. Its genome comprises 1,830,137 base pairs and 
encodes approximately 1740 proteins (Figure 6.6). 

Many other bacterial and archaeal genomes have 
since been sequenced. The first eukaryotic genome to be 
completely sequenced was that of baker s yeast, Sacchar- 
omyces cerevisiae, which comprises approximately 12 
million base pairs, distributed on 16 chromosomes, and 
encodes more than 6000 proteins. This achievement was 
followed by the first complete sequencing of the genome 
of a multicellular organism, the nematode Caenorhabditis elegans, which 
contains nearly 100 million base pairs. The human genome is considerably 
larger at more than 3 billion base pairs, but it has been essentially com¬ 
pletely sequenced. The ability to determine complete genome sequences has 
revolutionized biochemistry and biology. 


FIGURE 6.6 A complete genome. The diagram depicts the 
genome of Haemophilus infíuenzae, the first complete genome of a 
free-living organism to be sequenced. The genome encodes more 
than 1700 proteins and 70 RNA molecules. The likely function of 
approximately one-half of the proteins was determined by 
comparisons with sequences from proteins previousiy characterized 
in other species. [From R. D. Fleischmann et al., Science 269(1995):496; 
scan courtesy of TI GR.] 


FIGURE 6.7 Solíd-phase synthesís of a 
DNA Chain by the phosphíte triester 
method. The activated monomer added 
to the growing chain is a deoxyribo- 
nucleoside 3'-phosphoramidite containing 
a DMT protecting group on its 5' oxygen 
atom, a jS-cyanoethyl (jSCE) protecting 
group on its 3' phosphoryl oxygen, and a 
protecting group on the base. 


6.1.4 DNA Probes and Genes Can Be Synthesized 
by Automated Solid-Phase Methods 

DNA strands, like polypeptides (Section 4.4), can be synthesized by the se- 
quential addition of activated monomers to a growing chain that is linked 
to an insoluble support. The activated monomers are protonated deoxyri- 
bonucleoside 3' -phosphoramidites. In step 1, the 3' phosphorus atom of this 
incoming unit becomes joined to the 5' oxygen atom of the growing chain 
to form diphosphite triester (Figure 6.7). The 5'-OH group of the activated 
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monomer is unreactive because it is blocked by a dimethoxytrityl (DMT) 
protecting group, and the 3'-phosphoryl group is rendered unreactive by 
attachment of the j 8 -cyanoethyl (jSCE) group. Likewise, amino groups on 
the purine and pyrimidine bases are blocked. 

Coupling is carried out under anhydrous conditions 
because water reacts with phosphoramidites. In step 2, the 
phosphite triester (in which P is trivalent) is oxidized by 
iodine to form diphosphotriester (in which P is pentavalent). 

In step 3, the DMT protecting group on the 5' - OH of the 
growing chain is removed by the addition of dichloro- 
acetic acid, which leaves other protecting groups intact. 

The DNA chain is now elongated by one unit and ready 
for another cycle of addition. Each cycle takes only about 
10 minutes and elongates more than 98% of the chains. 

This solid-phase approach is ideal for the synthesis of 
DNA, as it is for polypeptides, because the desired prod- 
uct stays on the insoluble support until the final release 
step. All the reactions take place in a single vessel, and 
excess soluble reagents can be added to drive reactions to 
completion. At the end of each step, soluble reagents and 
by-Products are washed away from the glass beads that 
bear the growing chains. At the end of the synthesis, NH 3 
is added to remove all protecting groups and release the oligonucleotide from 
the solid support. Because elongation is never 100 % complete, the new DNA 
chains are of diverse lengths—the desired chain is the longest one. The sam- 
ple can be purified by high-pressure liquid chromatography or by elec- 
trophoresis on polyacrylamide gels. DNA chains of as many as 100 nu- 
cleotides can be readily synthesized by this automated method. 

The ability to rapidly synthesize DNA chains of any selected sequence 
opens many experimental avenues. For example, synthesized oligonu¬ 
cleotide labeled at one end with ^^P or a fluorescent tag can be used to search 
for a complementary sequence in a very long DNA molecule or even in a 
genome consisting of many chromosomes. The use of labeled oligonu- 
cleotides as DNA probes is powerful and general. For example, a DNA probe 
that can base-pair to a known complementary sequence in a chromosome 
can serve as the starting point of an exploration of adjacent uncharted DNA. 
Such a probe can be used as a primer to initiate the replication of neighbor- 
ing DNA by DNA polymerase. One of the most exciting applications of 
the solid-phase approach is the synthesis of new tailor-made genes. New pro- 
teins with novel properties can now be produced in abundance by express- 
ing synthetic genes. Protein engineering has become a reality. 

6.1.5 Selected DNA Sequences Can Be Greatly Amplified 
by the Polymerase Chain Reaction 
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In 1984, Kary Mullis devised an ingenious method called the polymerase 
chain reaction (PCR) for amplifying specific DNA sequences. Consider a 
DNA duplex consisting of a target sequence surrounded by nontarget 
DNA. Millions of the target sequences can be readily obtained by PCR if 
the flanking sequences of the target are known. PCR is carried out by adding 
the following components to a solution containing the target sequence: ( 1 ) 
a pair of primers that hybridize with the flanking sequences of the target, 
( 2 ) all four deoxyribonucleoside triphosphates (dNTPs), and (3) a heat-sta- 
ble DNA polymerase. A PCR cycle consists of three steps (Figure 6 . 8 ). 

1 . Strand separation. The two strands of the parent DNA molecule are sep- 
arated by heating the solution to 95°C for 15 s. 
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FIGURE 6.8 The first cycle in the 
polymerase chain reaction (PCR). A 

cycle consists of three steps: strand 
separation, hybridization of primers, and 
extension of primers by DNA synthesis. 
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FIGURE 6.9 Multiple cycles of the polymerase chaín reactíon. 

The two short strands produced at the end of the third cycle (along 
with longer stands not shown) represent the target sequence. 
Subsequent cycles will amplify the target sequence exponentially and 
the parent sequence arithmetically. 

2. Hybridization of primers. The solution is then abruptly 
cooled to 54 °C to allow each primer to hybridize to a 
DNA strand. One primer hybridizes to the 3'-end of the 
target on one strand, and the other primer hybridizes to 
the 3' end on the complementary target strand. Parent 
DNA duplexes do not form, because the primers are 
present in large excess. Primers are typically from 20 to 
30 nucleotides long. 

3. DNA synthesis. The solution is then heated to 72°C, 
the optimal temperature for Taq DNA polymerase. This 
heat-stable polymerase comes from Thermus aquaticus, a 
thermophilic bacterium that lives in hot springs. The 
polymerase elongates both primers in the direction of the 
target sequence because DNA synthesis is in the 5'-to- 
3' direction. DNA synthesis takes place on both strands 
but extends beyond the target sequence. 

These three steps —strand separation, hybridization 
of primers, and DNA synthesis—constitute one cycle of 
the PCR amplification and can be carried out repetitively 
just by changing the temperature of the reaction mixture. 
The thermostability of the polymerase makes it feasible 
to carry out PCR in a closed Container; no reagents are 
added after the first cycle. The duplexes are heated to be- 
gin the second cycle, which produces four duplexes, and 
then the third cycle is initiated (Figure 6.9). At the end 
of the third cycle, two short strands appear that consti¬ 
tute only the target sequence—the sequence including 
and bounded by the primers. Subsequent cycles will am¬ 
plify the target sequence exponentially. The larger 
strands increase in number arithmetically and serve as a 
source for the synthesis of more short strands. Ideally, af¬ 
ter n cycles, this sequence is amplified 2'^-fold. The am¬ 
plification is a millionfold after 20 cycles and a billion- 
fold after 30 cycles, which can be carried out in less than 
an hour. 

Several features of this remarkable method for ampli- 
fying DNA are noteworthy. First, the sequence of the tar¬ 
get need not be known. All that is required is knowledge 
of the flanking sequences. Second, the target can be much 
larger than the primers. Targets larger than 10 kb have 
been amplified by PCR. Third, primers do not have to 

- 1 be perfectly matched to flanking sequences to amplify 

— _ targets. With the use of primers derived from a gene of 

- 1 known sequence, it is possible to search for variations on 

— ^ the theme. In this way, families of genes are being dis- 

I covered by PCR. Fourth, PCR is highly specific because 

of the stringency of hybridization at high temperature 
(54 °C). Stringency is the required closeness of the match 
between primer and target, which can be controlled by temperature and salt. 
At high temperatures, the only DNA that is amplified is that situated be- 
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tween primers that have hybridized. A gene constituting less than a millionth 
of the total DNA of a higher organism is accessible by PCR. Fifth, PCR is 
exquisitely sensitive. A single DNA molecule can be amplified and detected. 

6.1.6 PCR Is a PowerfuI Technique in Medicai Diagnostics, 
Forensics, and Molecular Evolution 

PCR can provide valuable diagnostic Information in medicine. Bactéria and 
viruses can be readily detected with the use of specific primers. For example, 
PCR can reveal the presence of human immunodeficiency virus in people who 
have not mounted an immune response to this pathogen and would therefore 
be missed with an antibody assay. Finding Mycobacterium tuberculosis bacilli 
in tissue specimens is slow and laborious. With PCR, as few as 10 tubercle 
bacilli per million human cells can be readily detected. PCR is a promising 
method for the early detection of certain câncers. This technique can identify 
mutations of certain growth-control genes, such as the ras genes (Section 1 S.x.x). 
The capacity to greatly amplify selected regions of DNA can also be highly in- 
formative in monitoring câncer chemotherapy. Tests using PCR can detect 
when cancerous cells have been eliminated and treatment can be stopped; they 
can also detect a relapse and the need to immediately resume treatment. PCR 
is ideal for detecting leukemias caused by chromosomal rearrangements. 

PCR is also having an effect in forensics and legal medicine. An indi¬ 
vidual DNA profile is highly distinctive because many genetic loci are 
highly variable within a population. For example, variations at a specific 
one of these locations determines a person^s HLA type (human leukocyte 
antigen type); organ transplants are rejected when the HLA types of the 
donor and recipient are not sufficiently matched. PCR amplification of mul- 
tiple genes is being used to establish biological parentage in disputed pa- 
ternity and immigration cases. Analyses of blood stains and semen samples 
by PCR have implicated guilt or innocence in numerous assault and rape 
cases. The root of a single shed hair found at a crime scene contains enough 
DNA for typing by PCR (Figure 6.10). 

DNA is a remarkably stable molecule, particularly when relatively 
shielded from air, light, and water. Under such circumstances, large frag- 
ments of DNA can remain intact for thousands of years or longer. PCR pro- 
vides an ideal method for amplifying such ancient DNA molecules so that 
they can be detected and characterized (Section 7.5.1). PCR can also be used 
to amplify DNA from microorganisms that have not yet been isolated and 
cultured. As will be discussed in the next chapter, sequences from these 
PCR Products can be sources of considerable insight into evolutionary re- 
lationships between organisms. 


6.2 RECOMBINANT DNA TECHNOLOGY HAS 
REVOLUTIONIZED ALL ASPECTS OF BIOLOGY 


The pioneering work of Paul Berg, Herbert Boyer, and Stanley Cohen in 
the early 1970s led to the development of recombinant DNA technology, 
which has permitted biology to move from an exclusively analytical Science 
to a synthetic one. New combinations of unrelated genes can be constructed 
in the laboratory by applying recombinant DNA techniques. These novel 
combinations can be cloned—amplified manyfold—by introducing them 
into suitable cells, where they are replicated by the DNA-synthesizing ma- 
chinery of the host. The inserted genes are often transcribed and translated 
in their new setting. What is most striking is that the genetic endowment 
of the host can be permanently altered in a designed way. 


I 151 I 

Recombinant DNA Technology 


\ 1 kb TS 


4^.8 8^.8 

jeans ^shirt^ 


\ Ikb 




FIGURE 6.10 DNA and forensics. DNA 

analysis can be used to establish guilt in 
criminal cases. Here, DNA was isolated 
from bloodstains on the pants and shirt of 
a defendant and amplified by PCR. The 
DNA was then compared to the DNA from 
the victim as well as the defendant using 
gel electrophoresis and autoradiography. 
DNA from the bloodstains on the 
defendant's clothing matched the pattern 
of the victim, but not that of the 
defendant. The frequency of a coincidental 
match of the DNA pattern on the clothing 
and the victim is approximately 1 in 33 
billion. Lanes A, 1 kb, and TS = Control 
DNA samples; lane D = DNA from the 
defendant; jeans = DNA isolated from 
bloodstains on defendenfs pants; shirt = 
DNA isolated from bloodstains of the 
defendanfs shirt (two different amounts 
analyzed); V = DNA sample from victim's 
blood. [Courtesy of Cellmark Diagnostics, 
Gernnantown MD.] 
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Let us begin by seeing how novel DNA molecules can be constructed in the 
laboratory. A DNA fragment of interest is covalently joined to a DNA vec- 
tor. The essential feature of a vector is that it can replicate autonomously in 
an appropriate host. Plasmids (naturally occurring circles of DNA that act 
as accessory chromosomes in bactéria) and bacteriophage A, a virus, are 
choice vectors for cloning in E. coli. The vector can be prepared for accept- 
ing a new DNA fragment by cleaving it at a single specific site with a re¬ 
striction enzyme. For example, the plasmid pSClOl, a 9.9-kb double- 
helical circular DNA molecule, is split at a unique site by the EcoKl 
restriction enzyme. The staggered cuts made by this enzyme produce comp- 
lementary single-stranded ends, which have specific affinity for each other 
and hence are known as cohesive or sticky ends. Any DNA fragment can be 
inserted into this plasmid if it has the same cohesive ends. Such a fragment 
can be prepared from a larger piece of DNA by using the same restriction 
enzyme as was used to open the plasmid DNA (Figure 6.11). 


FIGURE 6.11 Joíníng of DNA molecules 
by the cohesíve-end method. Two 

DNA molecules, cleaved with a common 
restriction enzyme such as EcoR\, can be 
ligated to form recombinant molecules. 
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FIGURE 6.12 Formatíon of cohesive ends. Cohesive ends 
are formed by the addition and cleavage of a chemically 
synthesized linker. 


The single-stranded ends of the fragment are then complementary to those 
of the cut plasmid. The DNA fragment and the cut plasmid can be annealed 
and then joined by DNA ligase, which catalyzes the formation of a phos- 
phodiester bond at a break in a DNA chain. DNA ligase requires a free 3'- 
hydroxyl group and a 5'-phosphoryl group. Furthermore, the chains joined 
by ligase must be in a double helix. An energy source such as ATP or NAD^ 
is required for the joining reaction, as will be discussed in Chapter 27. 

^ This cohesive-end method for joining DNA mole- 

' cules can be made general by using a short, chemically 

synthesized DNA linker that can be cleaved by restriction 
enzymes. First, the linker is covalently joined to the ends 
of a DNA fragment or vector. For example, the 5' ends 
of a decameric linker and a DNA molecule are phospho- 
rylated by polynucleotide kinase and then joined by the 
ligase from T4 phage (Figure 6.12). This ligase can form 
a covalent bond between blunt-ended (flush-ended) dou- 
ble-helical DNA molecules. Cohesive ends are produced 
when these terminal extensions are cut by an appropriate 
restriction enzyme. Thus, cohesive ends corresponding to 
a particular restriction enzyme can be added to virtually 
any DNA molecule. We see here the fruits of combining 
enzymatic and synthetic chemical approaches in crafting 
new DNA molecules. 
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6.2.2 Plasmids and Lambda Phage Are Choice Vectors 
for DNA Cloning in Bactéria 

Many plasmids and bacteriophages have been ingeniously modified to en- 
hance the delivery of recombinant DNA molecules into bactéria and to fa- 
cilitate the selection of bactéria harboring these vectors. Plasmids are cir¬ 
cular duplex DNA molecules occurring naturally in some bactéria and 
ranging in size from 2 to several hundred kilobases. They carry genes for 
the inactivation of antibiotics, the production of toxins, and the breakdown 
of natural products. These accessory chromosomes can replicate independ- 
ently of the host chromosome. In contrast with the host genome, they are 
dispensable under certain conditions. A bacterial cell may have no plasmids 
at all or it may house as many as 20 copies of a plasmid. 

pBR322 Plasmid. One of the most useful plasmids for cloning is pBR322, 
which contains genes for resistance to tetracycline and ampicillin (an an- 
tibiotic like penicillin). Different endonucleases can cleave this plasmid at 
a variety of unique sites, at which DNA fragments can be inserted. Inser- 
tion of DNA at the EcoKl restriction site does not alter either of the genes 
for antibiotic resistance (Figure 6.13). However, insertion at the fímdlll, 
Sa/I, or BamHl restriction site inactivates the gene for tetracycline resist¬ 
ance, an effect called insertional inactivation. Cells containing pBR322 with 
a DNA insert at one of these restriction sites are resistant to ampicillin but 
sensitive to tetracycline, and so they can be readily selected. Cells that failed 
to take up the vector are sensitive to both antibiotics, whereas cells con¬ 
taining pBR322 without a DNA insert are resistant to both. 

Lambda (A) Phage. Another widely used vector, A phage, enjoys a choice 
of life styles: this bacteriophage can destroy its host or it can become part 
of its host (Figure 6.14). In the lytic pathway, virai functions are fully ex- 
pressed: virai DNA and proteins are quickly produced and packaged into 
virus particles, leading to the lysis (destruction) of the host cell and the sud- 
den appearance of about 100 progeny virus particles, or virions. In the lyso- 
genic pathway, the phage DNA becomes inserted into the host-cell genome 
and can be replicated together with host-cell DNA for many generations, 
remaining inactive. Certain environmental changes can trigger the expres- 
sion of this dormant virai DNA, which leads to the formation of progeny 
virus and lysis of the host. Large segments of the 48-kb DNA of A phage 
are not essential for productive infection and can be replaced by foreign 
DNA, thus making A phage an ideal vector. 
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FIGURE 6.13 Genetic map of the 
plasmid pBR322. This plasmid carries 
two genes for antibiotic resistance. Like all 
other plasmids, it is a circular duplex DNA. 


FIGURE 6.14 Alternative infection 
modes for A phage. Lambda phage can 
multiply within a host and lyse it (lytic 
pathway), or its DNA can become 
integrated into the host genome 
(lysogenic pathway), where it is dormant 
until activated. 
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XDNA 


FIGURE 6.15 Mutant A phage as a 
doníng vector. The packaging process 
selects DNA molecules that contain an 
insert. 



FIGURE 6.16 Electron mícrograph of 
Ml3 filamentous phage. [Courtesy of Dr. 
Robley Williams.] 
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Mutant A phages designed for cloning have been constructed. An espe- 
cially useful one called Agt-Aj8 contains only two EcoKl cleavage sites in- 
stead of the five normally present (Figure 6.15). After cleavage, the middie 
segment of this A DNA molecule can be removed. The two remaining pieces 
of DNA (called arms) have a combined length equal to 72% of a normal 
genome length. This amount of DNA is too little to be packaged into a A 
particle, because only DNA measuring from 75% to 105% of a normal 
genome in length can be readily packaged. However, a suitably long DNA 
insert (such as 10 kb) between the two ends of A DNA enables such a recom¬ 
binant DNA molecule (93% of normal length) to be packaged. Nearly all in¬ 
fective A particles formed in this way will contain an inserted piece of for¬ 
eign DNA. Another advantage of using these modified viruses as vectors is 
that they enter bactéria much more easily than do plasmids. Among the va- 
riety of A mutants that have been constructed for use as cloning vectors, one 
of them, called a cosmid, is essentially a hybrid of A phage and a plasmid 
that can serve as a vector for large DNA inserts (as large as 45 kb). 

M13 Phage. Another very useful vector for cloning DNA, Ml 3 phage is es- 
pecially useful for sequencing the inserted DNA. This filamentous virus is 
900 nm long and only 9 nm wide (Figure 6.16). Its 6.4-kb single-stranded cir- 
cle of DNA is protected by a coat of 2710 identical protein subunits. Ml 3 en¬ 
ter s E. coli through the bacterial sex pilus, a protein appendage that permits 
the transfer of DNA between bactéria. The single-stranded DNA in the virus 
particle [called the ( + ) strand] is replicated through an intermediate circu¬ 
lar double-stranded replicative form (RF) containing ( + ) and ( — ) strands. 
Only the ( + ) strand is packaged into new virus particles. About a thousand 
progeny Ml 3 are produced per generation. A striking feature of M13 is that 
it does not kill its bacterial host. Consequently, large quantities of M13 can 
be grown and easily harvested (1 gram from 10 liters of culture fluid). 

An Ml3 vector is prepared for cloning by cutting its circular RF DNA 
at a single site with a restriction enzyme. The cut is made in a polylinker re- 
gion that contains a series of closely spaced recognition sites for restriction 
enzymes; only one of each such sites is present in the vector. A double- 
stranded foreign DNA fragment produced by cleavage with the same re¬ 
striction enzyme is then ligated to the cut vector (Figure 6.17). The foreign 
DNA can be inserted in two different orientations because the ends of both 























DNA molecules are the same. Hence, half the new ( + ) strands packaged 
into virus particles will contain one of the strands of the foreign DNA, and 
half will contain the other strand. Infection of E. coli by a single virus parti- 
cie will yield a large amount of single-stranded Ml3 DNA containing the 
same strand of the foreign DNA. DNA cloned into Ml3 can be easily se- 
quenced. An oligonucleotide that hybridizes adjacent to the polylinker re- 
gion is used as a primer for sequencing the insert. This oligomer is called a 
universal sequencing primer because it can be used to sequence any insert. Ml 3 
is ideal for sequencing but not for long-term propagation of recombinant 
DNA, because inserts longer than about 1 kb are not stably maintained. 

6.2.3 Specific Genes Can Be Cloned from Digests 
of Genomic DNA 

Ingenious cloning and selection methods have made feasible the isolation of 
a specific DNA segment several kilobases long out of a genome containing 
more than 3 X 10^ kb. Let us see how a gene that is present just once in a hu- 
man genome can be cloned. A sample containing many molecules of total ge¬ 
nomic DNA is first mechanically sheared or partly digested by restriction en- 
zymes into large fragments (Figure 6.18). This nearly random population of 
overlapping DNA fragments is then separated by gel electrophoresis to iso- 
late a set about 15 kb long. Synthetic linkers are attached to the ends of these 
fragments, cohesive ends are formed, and the fragments are then inserted into 
a vector, such as A phage DNA, prepared with the same cohesive ends. E. coli 
bactéria are then infected with these recombinant phages. The resulting lysate 
contains fragments of human DNA housed in a sufficiently large number of 
virus particles to ensure that nearly the entire genome is represented. These 
phages constitute a genomic library. Phages can be propagated indefinitely, 
and so the library can be used repeatedly over long periods. 

This genomic library is then screened to find the very small portion of 
phages harboring the gene of interest. For the human genome, a calculation 
shows that a 99% probability of success requires screening about 500,000 
clones; hence, a very rapid and efficient screening process is essential. Rapid 
screening can be accomplished by DNA hybridization. 
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FIGURE 6.18 Creation of a genomic 
library. A genomic library can be created 
from a digest of a whole eukaryotic 
genome. 
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FIGURE 6.17 Ml3 phage DNA, a 
cloning and sequencing vector. Ml3 

phage DNA is very usefui in sequencing 
DNA fragments by the dideoxy method. A 
double-stranded DNA fragment is inserted 
into M13 RF DNA. Synthesis of new strand 
is primed by an oligonucleotide that is 
complementary to a sequence near the 
inserted DNA. 
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FIGURE 6.19 Screeníng a genomíc 
library for a specífíc gene. Here, a 
plate is tested from plaques containing 
gene a of Figure 6.18. 


A dilute suspension of the recombinant phages is first plated on a lawn 
of bactéria (Figure 6.19). Where each phage particle has landed and infected 
a bacterium, aplaque containing identical phages develops on the plate. A 
replica of this master plate is then made by applying a sheet of nitrocellu- 
lose. Infected bactéria and phage DNA released from lysed cells adhere to 
the sheet in a pattern of spots corresponding to the plaques. Intact bactéria 
on this sheet are lysed with NaOH, which also serves to denature the DNA 
so that it becomes accessible for hybridization with a ^^P-labeled probe. The 
presence of a specxfic DNA sequence in a single spot on the replica can be de- 
tected by using a radioactive complementary DNA or RN A molecule as a probe. 
Autoradiography then reveals the positions of spots harboring recombinant 
DNA. The corresponding plaques are picked out of the intact master plate 
and grown. A single investigator can readily screen a million clones in a day. 

This method makes it possible to isolate virtually any gene, provided that 
a probe is available. How does one obtain a specific probe? One approach 
is to start with the corresponding mRNA from cells in which it is abundant. 
For example, precursors of red blood cells contain large amounts of mRNA 
for hemoglobin, and plasma cells are rich in mRNAs for antibody mole- 
cules. The mRNAs from these cells can be fractionated by size to enrich for 
the one of interest. As will be described shortly, a DNA complementary to 
this mRNA can be synthesized in vitro and cloned to produce a highly spe¬ 
cific probe. 

Alternatively, a probe for a gene can be prepared if part of the amino acid 
sequence of the protein encoded by the gene is known. A problem arises be- 
cause a given peptide sequence can be encoded by a number of oligonu- 
cleotides (Figure 6.20). Thus, for this purpose, peptide sequences contain¬ 
ing tryptophan and methionine are preferred, because these amino acids are 
specified by a single codon, whereas other amino acid residues have between 
two and six codons (Section 5.5.1). 
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FIGURE 6.20 Probes generated from a protein sequence. A probe can be generated 
by synthesizing all possible oligonucleotides encoding a particular sequence of amino acids. 
Because of the degeneracy of the genetic code, 256 distinct oligonucleotides must be 
synthesized to ensure that the probe matching the sequence of seven amino acids is present. 

All the DNA sequences (or their complements) that encode the selected 
peptide sequence are synthesized by the solid-phase method and made ra¬ 
dioactive by phosphorylating their 5' ends with from [^^P]-ATP. The 
replica plate is exposed to a mixture containing all these probes and au- 
toradiographed to identify clones with a complementary DNA sequence. 
Positive clones are then sequenced to determine which ones have a sequence 
matching that of the protein of interest. Some of them may contain the de- 
sired gene or a significant segment of it. 

6.2.4 Long Stretches of DNA Can Be Efficientiy 
Analyzed by Chromosome Walking 

A typical genomic DNA library housed in A phage vectors consists of DNA 
fragments about 15 kb long. However, many eukaryotic genes are much 
longer—for example, the dystrophin gene, which is mutated in Duchenne 
muscular dystrophy, is 2000 kb long. How can such long stretches of DNA 
be analyzed? The development of cosmids helped because these chimeras 





of plasmids and A phages can house 45-kb inserts. Much larger pieces of 
DNA can now be propagated in bacterial artificial chromosomes (BACs) or 
yeast artificial chromosomes (YACs). YACs contain a centromere, an aut- 
onomous replication sequence (ARS, where replication begins), a pair of telom- 
eres (normal ends of eukaryotic chromosomes), selectable marker genes, and 
a cloning site (Figure 6.21). Genomic DNA is partly digested by a restric- 
tion endonuclease that cuts, on the average, at distant sites. The fragments 
are then separated by pulsed-field gel electrophoresis, and the large ones ( ~ 
450 kb) are eluted and ligated into YACs. Artificial chromosomes bearing 
inserts ranging from 100 to 1000 kb are efficiently replicated in yeast cells. 

Equally important in analyzing large genes is the capacity to scan long 
regions of DNA. The principie technique for this purpose makes use of 
overlaps in the library fragments. The fragments in a cosmid or YAC library 
are produced by random cleavage of many DNA molecules, and so some of 
the fragments overlap one another. Suppose that a fragment containing re- 
gion A selected by hybridization with a complementary probe A' also con- 
tains region B (Figure 6.22). A new probe B' can be prepared by cleaving 
this fragment between regions A and B and subcloning region B. If the li¬ 
brary is screened again with probe B', new fragments containing region B 
will be found. Some will contain a previously unknown region C. Hence, 
we now have information about a segment of DNA encompassing regions 
A, B, and C. This process of subcloning and rescreening is called chromo- 
some walking. Long stretches of DNA can be analyzed in this way, provided 
that each of the new probes is complementary to a unique region. 
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6.3 MANIPULATING THE GENES OF EUKARYOTES 


Eukaryotic genes, in a simplified form, can be introduced into bactéria, and 
the bactéria can be used as factories to produce a desired protein product. 
It is also possible to introduce DNA into higher organisms. In regard to an¬ 
imais, this ability provides a valuable tool for examining gene action, and it 
will be the basis of gene therapy. In regard to plants, introduced genes may 
make a plant resistant to pests or capable of growing in harsh conditions or 
able to carry greater quantities of essential nutrients. The manipulation of 
eukaryotic genes holds much promise for medicai and agricultural benefits, 
but it is also the source of controversy. 

6.3.1 Complementary DNA Prepared from mRNA 
Can Be Expressed in Host Cells 

How can mammalian DNA be cloned and expressed by E. coli? Recall that 
most mammalian genes are mosaics of introns and exons (Section 5.6). These 
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FIGURE 6.21 Diagram of a yeast 
artificial chromosome (YAC). DNA 

inserts as large as 1000 kb can be 
propagated in this vector. 


FIGURE 6.22 Chromosome walking. 

Long regions of unknown DNA can be 
explored, starting with a known base 
sequence, by subcloning and rescreening. 
New probes are designed on the basis of 
the DNA sequences that have been 
determined. 
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FIGURE 6.23 Synthesís of proinsulin 
by bactéria. Proinsulin, a precursor of 
insulin, can be synthesized by transformed 
(genetically altered) clones of E. coli. The 
clones contain the mammalian proinsulin 
gene. 


interrupted genes cannot be expressed by bactéria, which lack the machin- 
ery to splice introns out of the primary transcript. However, this difficulty 
can be circumvented by causing bactéria to take up recombinant DNA that 
is complementary to mRNA. For example, proinsulin, a precursor of in¬ 
sulin, is synthesized by bactéria harboring plasmids that contain DNA com¬ 
plementary to mRNA for proinsulin (Figure 6.23). Indeed, bactéria pro- 
duce much of the insulin used today by millions of diabetics. 

The key to forming complementary DNA (cDNA) is the enzyme rever se 
transcriptase. As discussed in Section 5.3.1, a retrovirus uses this enzyme to 
form a DNA-RNA hybrid in replicating its genomic RNA. Reverse tran¬ 
scriptase synthesizes a DNA strand complementary to an RNA template if 
it is provided with a DNA primer that is base-paired to the RNA and con- 
tains a free 3'-OH group. We can use a simple sequence of linked thymi- 
dine [oligo(T)] residues as the primer. This oligo(T) sequence pairs with 
the poly(A) sequence at the 3' end of most eukaryotic mRNA molecules 
(Section 5.4.4), as shown in Figure 6.24. The reverse transcriptase then syn¬ 
thesizes the rest of the cDNA strand in the presence of the four deoxyri- 
bonucleoside triphosphates. The RNA strand of this RNA-DNA hybrid is 
subsequently hydrolyzed by raising the pH. Unlike RNA, DNA is resis- 
tant to alkaline hydrolysis. The single-stranded DNA is converted into 
double-stranded DNA by creating another primer site. The enzyme term¬ 
inal transferase adds nucleotides—for instance, several residues of dG—to 
the 3' end of DNA. Oligo(dC) can bind to dG residues and prime the syn- 
thesis of the second DNA strand. Synthetic linkers can be added to this 
double-helical DNA for ligation to a suitable vector. Complementary DNA 
for all mRNA that a cell contains can be made, inserted into vectors, and 
then inserted into bactéria. Such a collection is called a cDNA library. 
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FIGURE 6.24 Formation of a cDNA duplex. A cDNA duplex is created from mRNA by 
using reverse transcriptase to synthesize a cDNA strand, first along the mRNA template and 
then, after digestion of the mRNA, along that same newly synthesized cDNA strand. 


















Complementary DNA molecules can be inserted into vectors that favor 
their efficient expression in hosts such as E. coli. Such plasmids or phages are 
called expression vectors. To maximize transcription, the cDNA is inserted into 
the vector in the correct reading frame near a strong bacterial promoter site. 
In addition, these vectors ensure efficient translation by encoding a ribosome- 
binding site on the mRNA near the initiation codon. Clones of cDNA can be 
screened on the basis of their capacity to direct the synthesis of a foreign pro- 
tein in bactéria. A radioactive antibody specific for the protein of interest can 
be used to identify colonies of bactéria that harbor the corresponding cDNA 
vector (Figure 6.25). As described in Section 6.2.3, spots of bactéria on a 
replica plate are lysed to release proteins, which bind to an applied nitrocel- 
lulose filter. A ^^^I-labeled antibody specific for the protein of interest is 
added, and autoradiography reveals the location of the desired colonies on the 
master plate. This immunochemical screening approach can be used when- 
ever a protein is expressed and corresponding antibody is available. 


6.3.2 Gene-Expression Leveis Can Be Comprehensively 
Examined 

Most genes are present in the same quantity in every cell—namely, one copy 
per haploid cell or two copies per diploid cell. However, the levei at which a 
gene is expressed, as indicated by mRNA quantities, can vary widely, rang- 
ing from no expression to hundreds of mRNA copies per cell. Gene-expres¬ 
sion patterns vary from cell type to cell type, distinguishing, for example, a 
muscle cell from a nerve cell. Even within the same cell, gene-expression lev¬ 
eis may vary as the cell responds to changes in physiological circumstances. 

Using our knowledge of complete genome sequences, it is now possible 
to analyze the pattern and levei of expression of all genes in a particular cell 
or tissue. One of the most powerful methods developed to date for this pur- 
pose is based on hybridization. High-density arrays of oligonucleotides, called 
DNA microarrays or gene chips, can be constructed either through light- 
directed chemical synthesis carried out with photolithographic microfabrica- 
tion techniques used in the semiconductor industry or by placing very small 
dots of oligonucleotides or cDNAs on a solid support such as a microscope slide. 
Fluorescently labeled cDNA is hybridized to the chip to reveal the expression 
levei for each gene, identifiable by its known location on the chip. (Figure 6.26). 


Urffiemt lumon 



FIGURE 6.26 Gene expression 
analysis using microarrays. The 

expression leveis of thousands of genes 
can be simultaneously analyzed using DNA 
microarrays (gene chips). Here, analysis of 
1733 genes in 84 breast tumor samples 
reveals that the tumors can be divided 
into distinct classes based on their gene 
expression patterns. Red corresponds to 
gene induction and green corresponds to 
gene repression. [Adapted from C. M. Perou et 
al., Nature 406(2000);747.] 
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FIGURE 6.25 Screening of cDNA 
clones. A method of screening for cDNA 
clones is to identify expressed products by 
staining with specific antibody. 
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The intensity of the fluorescent spot on the chip reveals the extent of tran- 
scription of a particular gene. DNA chips have been prepared that contain 
oligonucleotides complementary to all known open reading frames, 6200 
in number, within the yeast genome (Figure 6.27). An analysis of mRNA 
pools with the use of these chips revealed, for example, that approximately 
50% of all yeast genes are expressed at steady-state leveis of between 0.1 
and 1.0 mRNA copy per cell. This method readily detected variations in 
expression leveis displayed by specific genes under different growth con- 
ditions. These tools will continue to grow in power as genome sequencing 
efforts continue. 


FIGURE 6.27 Monitoring changes in 
yeast gene expression. This microarray 
analysis shows leveis of gene expression 
for yeast genes under different conditions. 
[Adapted from lyer et al., Nature 
409(2000):533.] 
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FIGURE 6.28 Microinjection of DNA. Cloned plasmid 
DNA is being microinjected into the male pronudeus of a 
fertilized mouse egg. 


6.3.3 New Genes Inserted into Eukaryotic Cells 
Can Be Efficientiy Expressed 

Bactéria are ideal hosts for the amplification of DNA molecules. They can 
also serve as factories for the production of a wide range of prokaryotic and 
eukaryotic proteins. However, bactéria lack the necessary enzymes to carry 
out posttranslational modifications such as the specific cleavage of polypep- 
tides and the attachment of carbohydrate units. Thus, many eukaryotic 
genes can be correctly expressed only in eukaryotic host cells. The intro- 
duction of recombinant DNA molecules into cells of higher organisms can 
also be a source of insight into how their genes are organized and expressed. 
How are genes turned on and off in embryological development? How does 
a fertilized egg give rise to an organism with highly differentiated cells that 
are organized in space and time? These central questions of biology can now 
be fruitfully approached by expressing foreign genes in mammalian cells. 

Recombinant DNA molecules can be introduced into animal cells in sev- 
eral ways. In one method, foreign DNA molecules precipitated by calcium 
phosphate are taken up by animal cells. A small fraction of the imported 
DNA becomes stably integrated into the chromosomal DNA. The effi- 
ciency of incorporation is low, but the method is useful because it is easy to 
apply. In another method, DNA is microinjected into cells. A fine-tipped 
(O.l-jüLm-diameter) glass micropipet containing a solution of foreign DNA 
is inserted into a nucleus (Figure 6.28). A skilled investigator can inject hun- 
dreds of cells per hour. About 2% of injected mouse cells are viable and con¬ 
tain the new gene. In a third method, viruses are used to bring new genes 
into animal cells. The most effective vectors are retroviruses. As discussed 
in Section 5.3.1, retroviruses replicate through DNA intermediates, the re¬ 
ver se of the normal flow of Information. A striking feature 
of the life cycle of a retrovirus is that the double-helical DNA 
form of its genome, produced by the action of rever se tran- 
scriptase, becomes randomly incorporated into host chro¬ 
mosomal DNA. This DNA version of the virai genome, 
called profira/ DNA, can be efficiently expressed by the host 
cell and replicated along with normal cellular DNA. Retro¬ 
viruses do not usually kill their hosts. Foreign genes have 
been efficiently introduced into mammalian cells by infect- 
ing them with vectors derived from Moloney murine leukemia 
virus, which can accept inserts as long as 6 kb. Some genes 
introduced by this retroviral vector into the genome of a 
transformed host cell are efficiently expressed. 

Two other virai vectors are extensively used. Vaccinia 
virus, a large DNA-containing virus, replicates in the cyto- 
plasm of mammalian cells, where it shuts down host-cell pro- 
tein synthesis. Baculovirus infects insect cells, which can be 
conveniently cultured. Insect larvae infected with this virus 









can serve as efficient protein factories. Vectors based on these large-genome 
viruses have been engineered to express DNA inserts efficiently. 

6.3.4 Transgenic Animais Harbor and Express Genes 
That Were Introduced into Their Germ Lines 

Genetically engineered giant mice illustrate the expression of foreign genes 
in mammalian cells (Figure 6.29). Giant mice were produced by introduc- 
ing the gene for rat growth hormone into a fertilized mouse egg. Growth hor- 
mone (somatotropin), a 21-kd protein, is normally synthesized by the pitu- 
itary gland. A deficiency of this hormone produces dwarfism, and an excess 
leads to gigantism. The gene for rat growth hormone was placed on a plas- 
mid next to the mouse metallothionein promoter (Figure 6.30). This pro- 
moter site is normally located on a chromosome, where it Controls the tran- 
scription of metallothionein, a cysteine-rich protein that has high affinity for 
heavy metais. Metallothionein binds to and sequesters heavy metais, many 
of which are toxic for metabolic processes (Section 17.3.2). The synthesis of 
this protective protein by the li ver is induced by heavy-metal ions such as 
cadmium. Hence, if mice contain the new gene, its expression can be initi- 
ated by the addition of cadmium to the drinking water. 



FIGURE 6.29 Transgenic mice. Injection 
of the gene for growth hormone into a 
fertilized mouse egg gave rise to a giant 
mouse (left), about twice the weight of his 
silbiing (right). [Courtesy of Dr. Ralph Brinster.] 
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FIGURE 6.30 Rat growth hormone- 
metallothionein gene construct. The 

gene for rat growth hormone (shown in 
yellow) was inserted into a plasmid next 
to the metallothionein promoter, which is 
activated by the addition of heavy metais, 
such as cadmium ion. 


Several hundred copies of the plasmid containing the promoter and 
growth-hormone gene were microinjected into the male pronucleus of a fer¬ 
tilized mouse egg, which was then inserted into the uterus of a foster mother 
mouse. A number of mice that developed from such microinjected eggs con- 
tained the gene for rat growth hormone, as shown by Southern blots of their 
DNA. These transgenic mice, containing multiple copies ( ~ 30 per cell) of 
the rat growth-hormone gene, grew much more rapidly than did control 
mice. In the presence of cadmium, the levei of growth hormone in these 
mice was 500 times as high as in normal mice, and their body weight at ma- 
turity was twice normal. The foreign DNA had been transcribed and its five 
introns correctly spliced out to form functional mRNA. These experiments 
strikingly demonstrate that a foreign gene under the control of a new promoter 
site can be integrated and efficiently expressed in mammalian cells. 
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FIGURE 6.31 Gene dísruptíon by 
homologous recombinatíon. (A) A 

mutated version of the gene to be disrupted 
is constructed, maintaining some regions 
of homology with the normal gene (red). 
When the foreign mutated gene is 
introduced into an embryonic stem cell, 

(B) recombination takes place at regions 
of homology and (C) the normal 
(targeted) gene is replaced, or "knocked 
out," by the foreign gene The cell is 
inserted into embryos, and mice lacking 
the gene (knockout mice) are produced. 



-Y- 

Mutated gene 


(B) 


A A 

Homologous recombination 


(C) 


] 


Mutation in the targeted gene 


6.3.5 Gene Disruption Provides Clues to Gene Function 

A gene’s function can also be probed by inactivating the gene and looking 
for resulting abnormalities. Powerful methods have been developed for ac- 
complishing gene disruption (also called gene knockout) in organisms such as 
yeast and mice. These methods rely on the process of homologous recombi¬ 
nation. Through this process, regions of strong sequence similarity exchange 
segments of DNA. Foreign DNA inserted into a cell thus can disrupt any 
gene that is at least in part homologous by exchanging segments (Figure 
6.31). Specific genes can be targeted if their nucleotide sequences are known. 

For example, the gene knockout approach has been applied to the genes 
encoding gene regulatory proteins (also called transcription factors) that 
control the differentiation of muscle cells. When both copies of the gene for 
the regulatory protein myogenin are disrupted, an animal dies at birth be- 
cause it lacks functional skeletal muscle. Microscopic inspection reveals that 
the tissues from which muscle normally forms contain precursor cells that 
have failed to differentiate fully (Figure 6.32). Heterozygous mice contain- 
ing one normal myogenin gene and one disrupted gene appear normal, in- 
dicating that the levei of gene expression is not essential for its function. 
Analogous studies have probed the function of many other genes to gener- 
ate animal models for known human genetic diseases. 



FIGURE 6.32 Consequences of gene disruption. Sections of muscle from normal 
(A) and gene-disrupted (B) mice, as viewed under the light microscope. Muscles do not 
develop properly in mice having both myogenin genes disrupted. [From P. Hasty, A. Bradiey, 
J. H. Morris, D. G. Edmondson, J. M. Venuti, E. N. Olson, and W. H. Klein, Nature 364(1993):501.] 














































6.3.6 Tumor-Indudng Plasmids Can Be Used 
to Introduce New Genes into Plant Cells 

The common soil bacterium Agrobacterium tumefaciens infects plants and 
introduces foreign genes into plants cells (Figure 6.33). A lump of tumor 
tissue called a crown gall grows at the site of infection. Crown galls synthe- 
size opines, a group of amino acid derivatives that are metabolized by the 
infecting bactéria. In essence, the metabolism of the plant cell is diverted to 
satisfy the highly distinctive appetite of the intruder. Tumor-inducing plas¬ 
mids (Ti plasmids) that are carried by Agrobacterium carry instructions for 
the switch to the tumor State and the synthesis of opines. A small part of 
the Ti plasmid becomes integrated into the genome of infected plant cells; 
this 20-kb segment is called T-DNA (transferred DNA; Figure 6.34). 


FIGURE 6.34 Ti plasmids. Agrobacteria 
containing Ti plasmids can deliver foreign 
genes into some plant cells. [After M. Chilton. 
A vector for introducing new genes into plants. 
Copyright ©1983 by Scientific American, Inc. 

All rights reserved.] 
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Ti plasmid derivatives can be used as vectors to deliver foreign genes 
into plant cells. First, a segment of foreign DNA is inserted into the T-DNA 
region of a small plasmid through the use of restriction enzymes and lig- 
ases. This synthetic plasmid is added to Agrobacterium colonies harboring 
naturally occurring Ti plasmids. By recombination. Ti plasmids containing 
the foreign gene are formed. These Ti vectors hold great promise as tools 
for exploring the genomes of plant cells and modifying plants to improve 
their agricultural value and crop yield. However, they are not suitable for 
transforming all types of plants. Ti-plasmid transfer is effective with dicots 
(broad-leaved plants such as grapes) and a few kinds of monocots but not 
with economically important cereal monocots. 

Foreign DNA can be introduced into cereal monocots as well as dicots 
by applying intense electric fields, a technique called electroporation (Figure 
6.35). First, the cellulose wall surrounding plant cells is removed by adding 
cellulase; this treatment produces protoplasts, plant cells with exposed 
plasma membranes. Electric pulses are then applied to a suspension of pro- 
toplasts and plasmid DNA. Because high electric fields make membranes 
transiently permeable to large molecules, plasmid DNA molecules enter the 
cells. The cell wall is then allowed to reform, and the plant cells are again 
viable. Maize cells and carrot cells have been stably transformed in this way 
with the use of plasmid DNA that includes genes for resistance to antibi- 
otics. Moreover, the transformed cells efficiently express the plasmid DNA. 
Electroporation is also an effective means of delivering foreign DNA into 
animal cells. 

The most effective means of transforming plant cells is through the use 
of ‘‘gene guns,'' or bombardment-mediated transformation. DNA is coated 
onto 1-jam-diameter tungsten pellets, and these microprojectiles are fired 



FIGURE 6.33 Tumors ín plants. 

Crown gall, a plant tumor, is caused by a 
bacterium (Agrobacterium tumefaciens) 
that carries a tumor-inducing plasmid 
(Ti plasmid). 
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FIGURE 6.35 Electroporation. Foreign 
DNA can be introduced into plant cells by 
electroporation, the appiication of intense 
electric fields to make their plasma 
membranes transiently permeable. 






















at the target cells with a velocity greater than 400 m Despite its appar- 
ent crudeness, this technique is proving to be the most effective way of trans- 
forming plants, especially important crop species such as soybean, corn, 
wheat, and rice. The gene-gun technique affords an opportunity to develop 
genetically modified organisms (GMOs) with beneficiai characteristics. 
Such characteristics could include the ability to grow in poor soils, resist- 
ance to natural climatic variation, resistance to pests, and nutritional forti- 
fication. These crops might be most useful in developing countries. The use 
of genetically modified organisms is highly controversial at this point be- 
cause of fears of unexpected side effects. 

The first GMO to come to market was a tomato characterized by de- 
layed ripening, rendering it ideal for shipment. Pectin is a polysaccharide 
that gives tomatoes their firmness and is naturally destroyed by the en- 
zymepolygalacturonase. As pectin is destroyed, the tomatoes soften, mak- 
ing shipment difficult. DNA was introduced that disrupts the poly- 
galacturonase gene. Less of the enzyme was produced, and the tomatoes 
stayed fresh longer. However, the tomatoes poor taste hindered its com- 
mercial success. 


6.4 NOVEL PROTEINS CAN BE ENGINEERED 
BY SITE-SPECIFIC MUTAGENESIS 

Much has been learned about genes and proteins by analyzing mutated 
genes selected from the repertoire offered by nature. In the classic genetic 
approach, mutations are generated randomly throughout the genome, and 
those exhibiting a particular phenotype are selected. Analysis of these mu- 
tants then reveals which genes are altered, and DNA sequencing identifies 
the precise nature of the changes. Recombinant DNA technology novo makes 
it feasible to create specific mutations in vitro. 

6.4.1 Proteins with New Functions Can Be Created 
Through Directed Changes in DNA 

We can construct new genes with designed properties by making three kinds 
of directed changes: deletions, insertions, and substitutions. 

Deletions. A specific deletion can be produced by cleaving a plasmid at 
two sites with a restriction enzyme and ligating to form a smaller circle. This 
simple approach usually removes a large block of DNA. A smaller deletion 
can be made by cutting a plasmid at a single site. The ends of the linear 
DNA are then digested with an exonuclease that removes nucleotides from 
both strands. The shortened piece of DNA is then ligated to form a circle 
that is missing a short length of DNA about the restriction site. 
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FIGURE 6.36 Oligonudeotíde-directed mutagenesís. A primer containing a 
mismatched nucleotide is used to produce a desired change in the DNA sequence. 





Substitutions: Oligonucleotide-Directed Mutagenesis. Mutant pro- 
teins with single amino acid substitutions can be readily produced by 
oligonucleotide-directed mutagenesis (Figure 6.36). Suppose that we want to 
replace a particular serine residue with cysteine. This mutation can be made 
if (1) we have a plasmid containing the gene or cDNA for the protein and 
(2) we know the base sequence around the site to be altered. If the serine of 
interest is encoded by TCT, we need to change the C to a G to get cysteine, 
which is encoded by TGT. This type of mutation is called a point mutation 
because only one base is altered. The key to this mutation is to prepare an 
oligonucleotide primer that is complementary to this region of the gene ex- 
cept that it contains TGT instead of TCT. The two strands of the plasmid 
are separated, and the primer is then annealed to the complementary strand. 
The mismatch of 1 base pair of 15 is tolerable if the annealing is carried out 
at an appropriate temperature. After annealing to the complementary 
strand, the primer is elongated by DNA polymerase, and the double- 
stranded circle is closed by adding DNA ligase. Subsequent replication of 
this duplex yields two kinds of progeny plasmid, half with the original TCT 
sequence and half with the mutant TGT sequence. Expression of the plas¬ 
mid containing the new TGT sequence will produce a protein with the de- 
sired substitution of serine for cysteine at a unique site. We will encounter 
many examples of the use of oligonucleotide-directed mutagenesis to pre- 
cisely alter regulatory regions of genes and to produce proteins with tailor- 
made features. 

Insertions: Cassette Mutagenesis, In another valuable approach, cassette 
mutagenesis, plasmid DNA is cut with a pair of restriction enzymes to re¬ 
move a short segment (Figure 6.37). A synthetic double-stranded oligonu¬ 
cleotide (the cassette) with cohesive ends that are complementary to the ends 
of the cut plasmid is then added and ligated. Each plasmid now contains 
the desired mutation. It is convenient to introduce into the plasmid unique 
restriction sites spaced about 40 nucleotides apart so that mutations can be 
readily made anywhere in the sequence. 

Designer Genes. Novel proteins can also be created by splicing together 
gene segments that encode domains that are not associated in nature. For 
example, a gene for an antibody can be joined to a gene for a toxin to pro¬ 
duce a chimeric protein that kills cells that are recognized by the antibody. 
These immunotoxins are being evaluated as anticancer agents. Entirely new 
genes can be synthesized de novo by the solid-phase method. Furthermore, 
noninfectious coat proteins of viruses can be produced in large amounts by 
recombinant DNA methods. They can serve as synthetic vaccines that are 
safer than conventional vaccines prepared by inactivating pathogenic 
viruses. A subunit of the hepatitis B virus produced in yeast is proving to 
be an effective vaccine against this debilitating virai disease. 

6.4.2 Recombinant DNA Technology Has Opened New Vistas 

Recombinant DNA technology has revolutionized the analysis of the mo¬ 
lecular basis of life. Complex chromosomes are being rapidly mapped and 
dissected into units that can be manipulated and deciphered. The amplifi- 
cation of genes by cloning has provided abundant quantities of DNA for 
sequencing. Genes are now open books that can be read. New insights are 
emerging, as exemplified by the discovery of introns in eukaryotic genes. 
Central questions of biology, such as the molecular basis of development, 
are now being fruitfully explored. DNA and RN A sequences provide a 
wealth of information about evolution. Biochemists now move back and 
forth between gene and protein and feel at home in both areas of inquiry. 
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FIGURE 6.37 Cassette mutagenesis. 

DNA is deaved at a pair of unique 
restriction sites by two different restriction 
endonuclease. A synthetic oligonucleotide 
with ends that are complementary to these 
sites (the cassette) is then ligated to the 
deaved DNA. The method is highly versatile 
because the inserted DNA can have any 
desired sequence. 
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FIGURE 6.38 The techniques of protein 
chemistry and nucleic acid chemistry 
are mutually reinforcing. (A) From DNA 
(or RNA) to protein and (B) from protein 
to DNA. 


Analyses of genes and cDNA can reveal the existence of previously un- 
known proteins, which can be isolated and purified (Figure 6.38A). Conversely, 
purification of a protein can be the starting point for the isolation and cloning 
of its gene or cDNA (Figure 6.39B). Very small amounts of protein or nucleic 
acid suffice because of the sensitivity of recently developed microchemical 
techniques and the amplification afforded by gene cloning and the polymerase 
chain reaction. The powerful techniques of protein chemistry, nucleic acid 
chemistry, immunology, and molecular genetics are highly synergistic. 

New kinds of proteins can be created by altering genes in specific ways. 
Site-specific mutagenesis opens the door to understanding how proteins 
fold, recognize other molecules, catalyze reactions, and process information. 
Large amounts of protein can be obtained by expressing cloned genes or 
cDNAs in bactéria or eukaryotic cells. Hormones, such as insulin, and an- 
tiviral agents, such as interferon, are being produced by bactéria. Tissue plas- 
minogen activator, which is administered to a patient after a heart attack, is 
made in large quantities in mammalian cells. A new pharmacology, using pro¬ 
teins produced by recombinant DNA technology as drugs, is beginning to 
significantly alter the practice of medicine. Recombinant DNA technology is 
also providing highly specific diagnostic reagents, such as DNA probes for 
the detection of genetic diseases, infections, and câncers. Human gene ther- 
apy has been successfully initiated. White blood cells deficient in adenosine 
deaminase, an essential enzyme, are taken from patients and returned after 
being transformed in vitro to correct the genetic error. Agriculture, too, is 
benefiting from genetic engineering. Transgenic crops with increased resist- 
ance to insects, herbicides, and drought have been produced. 


SUMMARY 


O The Basic Tools of Gene Exploration 

The recombinant DNA revolution in biology is rooted in the repertoire 
of enzymes that act on nucleic acids. Restriction enzymes are a key group 
among them. These endonucleases recognize specific base sequences in 
double-helical DNA and cleave both strands of the duplex, forming spe¬ 
cific fragments of DNA. These restriction fragments can be separated 












and displayed by gel electrophoresis. The pattern that they form on the 
gel is a fingerprint of a DNA molecule. A DNA fragment containing a 
particular sequence can be identified by hybridizing it with a labeled 
single-stranded DNA probe (Southern blotting). 

Rapid sequencing techniques have been developed to further the 
analysis of DNA molecules. DNA can be sequenced by controlled inter- 
ruption of replication (Sanger dideoxy method). The fragments produced 
are separated by gel electrophoresis and visualized by autoradiography of 
a labei at the 5' end or by fluorescent tags. The recent sequencing of 
many complete genomes demonstrates the power of these techniques. 

DNA probes for hybridization reactions, as well as new genes, can be 
synthesized by the automated solid-phase method. The technique is to 
add deoxyribonucleoside 3'-phosphoramidites to one another to form a 
growing chain that is linked to an insoluble support. DNA chains a hun- 
dred nucleotides long can be readily synthesized by this automated solid- 
phase method. The polymerase chain reaction makes it possible to greatly 
amplify specific segments of DNA in vitro. The region amplified is de- 
termined by the placement of a pair of primers that are added to the tar- 
get DNA along with a thermostable DNA polymerase and deoxyribonu¬ 
cleoside triphosphates. The exquisite sensitivity of PCR makes it a choice 
technique in detecting pathogens and câncer markers, in genotyping, and 
in reading DNA from fossils that are many thousands of years old. 

• Recombinant DNA Technology Mas Revolutionized All Aspects of Biology 

New genes can be constructed in the laboratory, introduced into host cells, 
and expressed. Novel DNA molecules are made by joining fragments 
that have complementary cohesive ends produced by the action 
of a restriction enzyme. DNA ligase seals breaks in DNA chains. Vec- 
tors for propagating the DNA include plasmids, A phage, and yeast ar¬ 
tificial chromosomes. Specific genes can be cloned from a genomic library 
using a DNA or RNA probe. Foreign DNA can be expressed after inser- 
tion into prokaryotic and eukaryotic cells by the appropriate vector. 

• Manipulating the Genes of Eukaryotes 

The production of giant mice by injecting the gene for rat growth hor- 
mone into fertilized mouse eggs vividly shows that mammalian cells can 
be genetically altered in a designed way. New DNA can be brought into 
plant cells by the soil bacterium Agrobacterium tumefadens, which har- 
bors Ti (tumor-inducing) plasmids. DNA can also be introduced into 
plant cells by applying intense electric fields, which render them tran- 
siently permeable to very large molecules, or by bombarding them with 
DNA-coated microparticles. Gene-expression leveis can be examined 
through the hybridization of cellular mRNA to arrays of oligonu- 
cleotides synthesized on solid supports (gene chips). The functions of 
particular genes can also be investigated by disruption. 

• Novel Proteins Can Be Engineered by Site-Specific Mutagenesis 

Specific mutations can be generated in vitro to engineer novel proteins. 
A mutant protein with a single amino acid substitution can be produced 
by priming DNA replication with an oligonucleotide encoding the new 
amino acid. Plasmids can be engineered to permit the facile insertion of 
a DNA cassette containing any desired mutation. The techniques of 
protein and nucleic acid chemistry are highly synergistic. Investigators 
now move back and forth between gene and protein with great facility. 
Recombinant DNA technology is beginning to significantly alter the 
practice of medicine by providing new diagnostic and therapeutic agents 
and revealing molecular mechanisms of disease. 
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-1 KEY TERMS |- 


restriction enzyme (p. 144) 
palindrome (p. 144) 

DNA probe (p. 146) 

Southern blotting (p.l46) 

Northern blotting (p. 146) 
controlled termination of replication 
(Sanger dideoxy method) (p. 146) 
polymerase chain reaction (PCR) 
(p. 149) 
vector (p. 152) 
plasmid (p. 152) 


sticky ends (p. 152) 

DNA ligase (p. 152) 
lambda (À) phage (p. 153) 
genomic library (p. 155) 
bacterial artificial chromosome (BAC) 
(p. 157) 

yeast artificial chromosome (YAC) 

(p. 157) 

chromosome walking (p. 157) 
complementary DNA (cDNA) (p. 158) 
reverse transcriptase (p. 158) 


cDNA library (p. 158) 
expression vector (p. 159) 

DNA microarray (gene chip) (p. 159) 
transgenic mice (p. 161) 
gene disruption (gene knockout) (p. 162) 
tumor-inducing plasmid (Ti plasmid) 

(p. 163) 

oligonucleotide - directed mutagenesis 

(p. 165) 

cassette mutagenesis (p. 165) 
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-1 PROBLEMS I- 

1. Reading sequences. An autoradi- 
ogram of a sequencing gel containing 
four lanes of DNA fragments is 
shown in the adjoining illustration. 

(a) What is the sequence of the DNA 
fragment? (b) Suppose that the 
Sanger dideoxy method shows that 
the template strand sequence is 
5'-TGCAATGGC-3'. Sketch the 
gel pattern that would lead to this 
conclusion. 

2. The right template. Ovalbumin is the major protein of egg 
white. The chicken ovalbumin gene contains eight exons sepa- 
rated by seven introns. Should one use ovalbumin cDNA or 
ovalbumin genomic DNA to form the protein in E. coli? Why? 

3. Cleavage frequency. The restriction enzyme Alui cleaves at the 
sequence 5'-AGCT-3', and Notl cleaves at 5'-GCGGCCGC- 
3'. What would be the average distance between cleavage sites 
for each enzyme on digestion of double-stranded DNA? 

4. The right cuts. Suppose that a human genomic library is pre- 
pared by exhaustive digestion of human DNA with the EcoKl 
restriction enzyme. Fragments averaging about 4 kb in length 
would be generated. 

(a) Is this procedure suitable for cloning large genes? Why? 

(b) Is this procedure suitable for mapping extensive stretches of 
the genome by chromosome walking? Why? 

5. A revealing cleavage. Sickle-cell anemia arises from a muta¬ 
tion in the gene for the (3 chain of human hemoglobin. The 
change from GAG to GTG in the mutant eliminates a cleavage 
site for the restriction enzyme MstII, which recognizes the tar- 
get sequence CCTGAGG. These findings form the basis of a 
diagnostic test for the sickle-cell gene. Propose a rapid procedure 
for distinguishing between the normal and the mutant gene. 
Would a positive result prove that the mutant contains GTG in 
place of GAG? 

6. Many melodies from one cassette. Suppose that you have iso- 
lated an enzyme that digests paper pulp and have obtained its 
cDNA. The goal is to produce a mutant that is effective at high 
temperature. You have engineered a pair of unique restriction sites 


in the cDNA that flank a 30-bp coding region. Propose a rapid 
technique for generating many different mutations in this region. 

7. A blessing and a curse. The power of PCR can also create prob¬ 
lems. Suppose someone claims to have isolated dinosaur DNA 
by using PCR. What questions might you ask to determine if it 
is indeed dinosaur DNA? 

8. Questions of accuracy. The stringency (Section 6.1.5) of PCR 
amplification can be controlled by altering the temperature at 
which the hybridization of the primers to the target DNA oceurs. 
How would altering the temperature of hybridization effect the 
amplification? Suppose that you have a particular yeast gene A, 
and you wish to see if it has a counterpart in humans. How would 
controlling the stringency of the hybridization help you? 

9. Terra incógnita. PCR is typically used to amplify DNA that 
lies between two known sequences. Suppose that you want to ex¬ 
plore DNA on both sides of a single known sequence. Devise a 
variation of the usual PCR protocol that would enable you to 
amplify entirely new genomic terrain. 

10. A puzzling ladder. A gel pattern displaying PCR produets 
shows four strong bands. The four pieces of DNA have lengths 
that are approximately in the ratio of 1:2:3:4. The largest band 
is cut out of the gel, and PCR is repeated with the same primers. 
Again, a ladder of four bands is evident in the gel. What does 
this result reveal about the strueture of the encoded protein? 

11. Landmarks in the genome. Many laboratories throughout the 
world are mapping the human genome. It is essential that the re- 
sults be merged at an early stage to provide a working physical 
map of each chromosome. In particular, it is necessary to know 
whether a YAC studied in one laboratory overlaps a YAC stud- 
ied in another when only a small proportion of each (less than 
5%) has been sequenced. Propose a simple test for overlap based 
on the transfer of information but not of materiais between the 
two laboratories. 

Chapter Integration Problem 

12. Designing primers. A successful PCR experiment often de- 
pends on designing the correct primers. In particular, the for 
each primer should be approximately the same. What is the ba¬ 
sis of this requirement? 


Termination 
A G C T 
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Chapter Integration and Data Analysis Problem 

13. Any direction but east. A series of people are found to have 
difficulty eliminating certain types of drugs from their blood- 
streams. The problem has been linked to a gene X, which en- 
codes an enzyme Y. Six people were tested with the use of vari- 
ous techniques of molecular biology. Person A is a normal 
control, person B is asymptomatic but some of his children have 
the metabolic problem, and persons C through F display the 
trait. Tissue samples from each person were obtained. Southern 
analysis was performed on the DNA after digestion with the re- 
striction enzyme HindIII. Northern analysis of mRNA also was 
done. In both types of analysis, the gels were probed with la- 
beled X cDNA. Finally, a Western blot with an enzyme-linked 
monoclonal antibody was used to test for the presence of protein 
Y. The results are shown here. Why is person B without symp- 
toms? Suggest possible defects in the other people. 


A B C D E F 



Data Interpretation Problem 

14. DNA diagnostics. Representations of sequencing gels for 
variants of the a chain of human hemoglobin are shown here. 
What is the nature of the amino acid change in each of the vari¬ 
ants? The first triplet encodes valine. 


HEMOGLOBIN TYPE 


Normal 


Chongqing 


Karachi 


Swan River 


G A T C 


G A T C 


G A T C 


G A T C 





Western blots 


























































Exploring Evolution 



Evolutionary relationships are manifest ín 
proteín sequences. The dose kinship between 
human beings and chimpanzees, hinted at by 
the mutual interest shown by Jane Goodall and a 
chimpanzee in the photograph, is revealed in the 
amino add sequences of myoglobin. The human 
sequence (red) differs from the chimpanzee 
sequence (blue) in only one amino acid in a 
protein chain of 153 residues. [(Left) Kennan 
Ward/Corbis.] 


GLSDGEWQLVLNVWGKVEADI PGHGQEVL I RLFKGHPET L EKFDKFKHLKS EDEMKASEDLKKHGATVLTALGG I L- 
GLSDGEWQLVLNVWGKVEADI PGHGQEVL I R L FKGHP ET L EKFDKFKH LKS EDEMKASEDLKKHGATVLTALGG I L- 

KKKGHHEAEIKPLAQSHATKHK I PVKYLEFI SEC I I QVLKSKHPGDFGADAQGAMNKALELFRKDMASNYKE LGFQG 
KKKGHHEAEIKPLAQSHATKHK I PVKYLEFI SEC I I QV l||S KHPGDFGADAQGAMNKALELFRKDMASNYKE LGFQG 


OUTLINE 


Like members of a human family, members of molecular families often have 
features in common. Such family resemblance is most easily detected by 
comparing three-dimensional structure, the aspect of a molecule most 
closely linked to function. Consider, for example, ribonuclease from cows, 
which was introduced in our consideration of protein folding (Section 3.6). 
Comparing structures reveals that the three-dimensional structure of this 
protein and that of a human ribonuclease are quite similar 
(Figure 7.1). Although this similarity is not unexpected, 
given the similarity in biological function, similarities re¬ 
vealed by comparisons are sometimes surprising. For ex¬ 
ample, angiogenin, a protein identified on the basis of its 
ability to stimulate the growth of new blood vessels, also 
turns out to be structurally similar to ribonuclease—so sim¬ 
ilar that it is clear that both angiogenin and ribonuclease are 
members of the same protein family (Figure 7.2). Angio¬ 
genin and ribonuclease must have had a common ancestor 
at some earlier stage of evolution. 

Unfortunately, three-dimensional structures have been 
determined for only a relatively small number of proteins. 

In contrast, gene sequences and the corresponding amino 
acid sequences are available for a great number of proteins, 
largely owing to the tremendous power of DNA cloning and 
sequencing techniques. Evolutionary relationships also are 
manifest in amino acid sequences. For example, comparison 
of the amino acid sequences of bovine ribonuclease and an¬ 
giogenin reveals that 35% of amino acids in corresponding 
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FIGURE 7.1 Structures of 
ríbonucleases from cows and human 
beings. Structural similarity often follows 
functional similarity. 



Angiogenin 




FIGURE 7.2 Structure of 
angiogenin. The protein angiogenin, 
identified on the basis of its ability to 
stimulate blood-vessel growth, is highly 
similar to ribonuclease. 


FIGURE 7.3 Two classes of homologs. 

Homologs that perform identical or very 
similar functions in different organisms are 
called orthologs, whereas homologs that 
perform different functions within one 
organism are called paralogs. 




Bovine ribonuclease 


Human ribonuclease 


positions are identical. Is this levei sufficiently high to ensure an evolu- 
tionary relationship? If not, what levei is required? In this chapter, we shall 
examine the methods that are used to compare amino acid sequences and 
to deduce such evolutionary relationships. 

Sequence-comparison methods have become a powerful tool in modem bio- 
chemistry. Sequence databases can be probed for matches to a newly elucidated 
sequence in order to identify related molecules. This information can often be 
a source of considerable insight into the function and mechanism of the newly 
sequenced molecule. When three-dimensional structures are available, they 
may be compared to confirm relationships suggested by sequence comparisons 
and to reveal others that are not readily detected at the levei of sequence alone. 

By examining the footprints present in modern protein sequences, the 
biochemist can become a molecular archeologist able to learn about events 
in the evolutionary past. Sequences comparisons can often reveal both 
pathways of evolutionary descent and estimated dates of specific evolu¬ 
tionary landmarks. This information can be used to construct evolutionary 
trees that trace the evolution of a particular protein or nucleic acid in many 
cases from Archaea and Bactéria through Eukarya, including human be¬ 
ings. Molecular evolution can also be studied experimentally. In some cases, 
DNA from fossils can be amplified by PCR methods (Section 6.1.5) and 
sequenced, giving a direct view into the past. In addition, investigators can 
observe molecular evolution taking place in the laboratory, through exper- 
iments based on nucleic acid replication. The results of such studies are re- 
vealing more about how evolution proceeds. 


COW 


Bovine ribonuclease 

(digestive enzyme) 




Human ribonuclease Angiogenin 

(digestive enzyme) (stimulates blood-vessel growth) 






7.1 HOMOLOGS ARE DESCENDED FROM A COMIVION 
ANCESTOR 


Analysis of Sequence Alignments 


The exploration of biochemical evolution consists largely of an attempt to 
determine how proteins, other molecules, and biochemical pathways have 
been transformed through time. The most fundamental relationship be- 
tween two entities is homology; two molecules are said to be homologous if 
they have been derived from a common ancestor. Homologous molecules, 
or homologs, can be divided into two classes (Figure 7.3). Paralogs are ho- 
mologs that are present within one species. Paralogs often differ in their de- 
tailed biochemical functions. Orthologs are homologs that are present within 
different species and have very similar or identical functions. Understand- 
ing the homology between molecules can reveal the evolutionary history of 
the molecules as well as Information about their function; if a newly se- 
quenced protein is homologous to an already characterized protein, we have 
a strong indication of the new protein s biochemical function. 

How can we tell whether two human proteins are paralogs or whether a 
yeast protein is the ortholog of a human protein? As will be discussed in Section 
7.2, homology is often manifested by significant similarity in nucleotide or amino 
acid sequence and almost always manifested in three-dimensional structure. 


7.2 STATISTICAL ANALYSIS OF SEQUENCE ALIGNIVIENTS 
CAN DETECT HOlVIOLOGY 


I interacti 


CONCEPTUAL INSIGHTS, Sequence Analysis, provides opportunities to 
interactively explore issues involved in sequence alignment 

A significant sequence similarity between two molecules implies that they 
are likely to have the same evolutionary origin and, therefore, the same 
three-dimensional structure, function, and mechanism. Although both nu- 
cleic acid and protein sequences can be compared to detect homology, a 
comparison of protein sequences is much more effective for several reasons, 
most notably that proteins are built from 20 different building blocks, 
whereas RN A and DNA are synthesized from only 4 building blocks. 

To illustrate sequence-comparison methods, let us consider a class of 
proteins called the globins. Myoglobin is a protein that binds oxygen in mus- 
cle, whereas hemoglobin is the oxygen-carrying protein in blood (Section 
10.2). Both proteins cr adie a heme group, an iron-containing organic mol- 
ecule that binds the oxygen. Each human hemoglobin molecule is composed 
of four heme-containing polypeptide chains, two identical a chains and two 
identical (3 chains. Here, we consider only the a chain. We wish to exam¬ 
ine the similarity between the amino acid sequence of the human a chain 
and that of human myoglobin (Figure 7.4). To detect such similarity, meth¬ 
ods have been developed for sequence alignment. 




_ CONCEPTUAL INSIGHTS, 

appearing throughout the book, are 
Interactive animations that help you build 
your understanding of key biochemical 
principies and concepts. To access, go to 
the Web site: wvvw.whfreeman.com/ 
biochemS, and select the chapter, 
Conceptual Insights, and the titie. 


Human hemoglobin (a chain) 

VLSPADKTNVKAAWGKVGAHAGEYGAEALERMFLSFPTTKTYFPHFDLSHG 

SAQVKGHGKKVADALTNAVAHVDDMPNALSALSDLHAHKLRVDPVNFKLLS 

HCLLVTLAAHLPAEFTPAVHASLDKFLASVSTVLTSKYR 


Human myoglobin 

GLSDGEWQLVLNVWGKVEADI PGHGQEVL I RLFKGHPETLEKFDKFKHLKS 
EDEMKASEDLKKHGATVLTALGGILKKKGHHEAEIKPLAQSHATKHKIPVK 
YLEFI SECI IQVLQSKHPGDFGADAQGAMNKALELFRKDMASNYKELGFQG 


FIGURE 7.4 Amino acid sequences of 
human hemoglobin (o; chain) and 
human myoglobin. Hemoglobin a is 
composed of 141 amino acids; myoglobin 
consists of 153 amino acids. (One-letter 
abbreviations designating amino acids are 
used; see Table 3.2.) 
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Alignment 


FIGURE 7.5 Comparing the amíno acid sequences of hemoglobin a and 
myoglobin. (A) A comparison is made by sliding the sequences of the two proteins past 
one another, one amino acid at a time, and counting the number of amino acid identities 
between the proteins. (B) The two alignments with the largest number of matches are 
shown above the graph, which plots the matches as a function of alignment. 


How can we tell where to align the two sequences? The simplest ap- 
proach is to compare all possible juxtapositions of one protein sequence with 
another, in each case recording the number of identical residues that are 
aligned with one another. This comparison can be accomplished by simply 
sliding one sequence past the other, one amino acid at a time, and counting 
the number of matched residues (Figure 7.5). 

For hemoglobin a and myoglobin, the best alignment reveals 23 se¬ 
quence identities, spread throughout the central parts of the sequences. 
However, a nearby alignment showing 22 identities is nearly as good. In 
this alignment, the identities are concentrated toward the amino-terminal 
end of the sequences. The sequences can be aligned to capture most of the 
identities in hoth alignments by introducing a gap into one of the sequences 
(Figure 7.6). Such gaps must often be inserted to compensate for the inser- 
tions or deletions of nucleotides that may have taken place in the gene for 
one molecule but not the other in the course of evolution. 
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The use of gaps substantially increases the complexity of sequence 
alignment because, in principie, the insertion of gaps of arbitrary sizes must 
be considered throughout each sequence. However, methods have been de- 
veloped for the insertion of gaps in the automatic alignment of sequences. 
These methods use scoring systems to compare different alignments, and 
they include penalties for gaps to prevent the insertion of an unreasonable 
number of them. Here is an example of such a scoring system: each iden- 
tity between aligned sequences results in +10 points, whereas each gap in- 
troduced, regardless of size, results in — 25 points. For the alignment shown 
in Figure 7.6, there are 38 identities and 1 gap, producing a score of (38 X 
10 — 1 X 25 = 355). Overall, there are 38 matched amino acids in an aver- 
age length of 147 residues; so the sequences are 25.9% identical. The next 
step is to ask. Is this precentage of identity significant? 

7.2.1 The Statistical Significance of Alignments Can Be 
Estimated by Shuffling 

The similarities in sequence in Figure 7.5 appear striking, yet there remains 
the possibility that a grouping of sequence identities has occurred by chance 
alone. How can we estimate the probability that a specific series of identi¬ 
ties is a chance occurrence? To make such an estimate, the amino acid se¬ 
quence in one of the proteins is ‘‘shuffled''—that is, randomly rearranged— 
and the alignment procedure is repeated (Figure 7.7). This process is 
repeated to build up a distribution showing, for each possible score, the 
number of shuffled sequences that received that score. 

When this procedure is applied to the sequences of myoglobin and he¬ 
moglobin o:, the authentic alignment clearly stands out (Figure 7.8). Its score 
is far abo ve the mean for the alignment scores based on shuffled sequences. 
The odds of such a deviation occurring owing due to chance alone are 
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FIGURE 7.6 Alignment with gap 
insertion. The alignment of hemoglobin 
a and myoglobin after a gap has been 
inserted into the hemoglobin a sequence. 


TH ISISTHEAUTHENTICSEQUENCE 
Shuffling 

SNUCSNSEATEEITUHEQIHHTTCEI 

FIGURE 7.7 The generation of a 
shuffled sequence. 


FIGURE 7.8 Statistical comparison of 
alignment scores. Alignment scores are 
calculated for many shuffled sequences, 
and the number of sequences generating 
a particular score is plotted against the 
score. The resulting plot is a distribution of 
alignment scores occurring by chance. The 
alignment score for hemoglobin a and 
myoglobin (shown in red) is substantially 
greater than any of these scores, strongly 
suggesting that the sequence similarity is 
significant. 
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approximately 1 in 10^^. Thus, we can comfortably conclude that the two 
sequences are genuinely similar; the simplest explanation for this similarity 
is that these sequences are homologous—that is, that the two molecules have 
descended by divergence from a common ancestor. 



FIGURE 7.9 A graphic view of the Blosum-62 substítutíon matríx. This scoring scheme 
was derived by examining substitutions that occur within aligned sequence blocks in related 
proteins. Amino adds are classified into four groups (charged, red; polar, green; large and 
hydrophobic, blue; other, black). Substitutions that require the change of only a single 
nucleotide are shaded. To find the score for a substitution of, for instance, a Y for an H, you 
find the Y in the column having H (boxed) at the top and check the number at the left. In this 
case, the resulting score is 3. 






















































































7.2.2 Distant Evolutionary Relationships Can Be Detected 
Through the Use of Substitution Matrices 

The scoring scheme in Section 7.2.1 assigns points only to positions occu- 
pied by identical amino acids in the two sequences being compared. No 
credit is given for any pairing that is not an identity. However, not all sub- 
stitutions are equivalent. Some are structurally conservative substitutions, re- 
placing one amino acid with another that is similar in size and chemical 
properties. Such conservative amino acid substitutions may have relatively 
minor effects on protein structure and can thus be tolerated without com- 
promising function. In other substitutions, an amino acid replaces one that 
is dissimilar. Furthermore, some amino acid substitutions result from the 
replacement of only a single nucleotide in the gene sequence; whereas oth- 
ers require two or three replacements. Conservative and single-nucleotide 
substitutions are likely to be more common than are substitutions with more 
radical effects. How can we account for the type of substitution when 
comparing sequences? We can approach this problem by first examining 
the substitutions that have actually taken place in evolutionarily related 
proteins. 

From the examination of appropriately aligned sequences, substitution 
matrices can be deduced. In these matrices, a large positive score corre- 
sponds to a substitution that occurs relatively frequently, whereas a large 
negative score corresponds to a substitution that occurs only rarely. The 
Blosum-62 substitution matrix illustrated in Figure 7.9 is an example. The 
highest scores in this substitution matrix indicate that amino acids such as 
cysteine (C) and tryptophan (W) tend to be conserved more than those 
such as serine (S) and alanine (A). Furthermore, structurally conservative 
substitutions such as lysine (K) for arginine (R) and isoleucine (I) for va- 
line (V) have relatively high scores. When two sequences are compared, 
each substitution is assigned a score based on the matrix. In addition, a gap 
penalty is often assigned according to the size of the gap. For example, the 
introduction of a gap lowers the alignment score by 12 points and the ex- 
tension of an existing gap costs 2 points per residue. Using this scoring Sys¬ 
tem, the alignment shown in Figure 7.6 receives a score of 115. In many 
regions, most substitutions are conservative (defined as those substitutions 
with scores greater than 0) and relatively few are strongly disfavored types 
(Figure 7.10). 

This scoring system detects homology between less obviously related 
sequences with greater sensitivity than would a comparison of identities 
only. Consider, for example, the protein leghemoglobin, an oxygen-binding 
protein found in the roots of some plants. The amino acid sequence of leghe¬ 
moglobin from the herb lupine can be aligned with that of human myoglo- 
bin and scored by using either the simple scoring scheme based on identi¬ 
ties only or the Blosum-62 scoring matrix (see Figure 7.9). Repeated 
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FIGURE 7.10 Alignment with 
conservative substitutions noted. The 

alignment of hemoglobin a and 
myoglobin with consen/ative substitutions 
indicated by yellow shading and identities 
by orange. 
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FIGURE 7.11 Alígnment of ídentítíes 
oniy versus the Blosum 62 matríx. 

Repeated shuffling and scoring reveal 
the significance of sequence alignment 
for human myoglobin versus lupine 
leghemoglobin with the use of either 
(A) the simple, identity-based scoring 
System or (B) the Blosum-62 matrix. The 
scores for the alignment of the authentic 
sequences are shown in red. The Blosum 
matrix provides greater statistical power. 




(B) Alignment score (Blosum 62) 


shuffling and scoring provides a distribution of alignment scores (Figure 
7.11). Scoring based on identities only indicates that the odds of the align¬ 
ment between myoglobin and leghemoglobin occurring by chance alone are 
1 in 20. Thus, although the levei of similarity suggests a relationship, there 
is a 5% chance that the similarity is accidental on the basis of this analysis. 
In contrast, users of the substitution matrix are able to incorporate the ef- 
fects of conservative substitutions. From such an analysis, the odds of the 
alignment occurring by chance are calculated to be approximately 1 in 300. 
Thus, an analysis performed by using the substitution matrix reaches a 
much firmer conclusion about the evolutionary relationship between these 
proteins (Figure 7.12). 
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FIGURE 7.12 Alígnment of human myoglobin and lupine leghemoglobin. The use of 

the Blosum-62 substitution matrix yields the alignment shown between human myoglobin 
and lupine leghemoglobin, illustrating identities (orange) and conservative substitutions 
(yellow). These sequences are 23% identical. 


Experience with sequence analysis has led to the development of simpler 
rules of thumb. For sequences longer than 100 amino acids, sequence iden¬ 
tities greater than 25% are almost certainly not the result of chance alone; 
such sequences are probably homologous. In contrast, if two sequences are 
less than 15% identical, pairwise comparison alone is unlikely to indicate sta- 
tistically significant similarity. For sequences that are between 15% and 2 5% 
identical, further analysis is necessary to determine the statistical significance 
of the alignment. It must be emphasized that the lack of a statistically sig¬ 
nificant degree of sequence similarity does not rule out homology. The sequences 
of many proteins that have descended from common ancestors have diverged 
to such an extent that the relationship between the proteins can no longer be 
detected from their sequences alone. As we will see, such homologous pro¬ 
teins can often be detected by examining three- dimensional structures. 

7.2.3 Databases Can Be Searched to Identify 
Homologous Sequences 

When the sequence of a protein is first determined, comparing it with all 
previously characterized sequences can be a source of tremendous insight 
into its evolutionary relatives and, hence, its structure and function. Indeed, 
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on a newly elucidated sequence. The sequence alignment methods heretofore Examination of Three-Dimensional Structure 

described are used to compare an individual sequence with all members of 
a database of known sequences. 

In 1995, investigators reported the first complete sequence of the genome 
of a free-living organism, the bacterium Haemophilus influenzae. Of 1743 
identified open reading frames (Section 6.3.2), 1007 (58%) could be linked by 
sequence-comparison methods to some protein of known function that had 
been previously characterized in another organism. An additional 347 open 
reading frames could be linked to sequences in the database for which no func¬ 
tion had yet been assigned ('Typothetical proteins’'). The remaining 389 se¬ 
quences did not match any sequence present in the database at the time at 
which the Haemophilus influenzae sequence was completed. Thus, investiga¬ 
tors were able to identify likely functions for more than half the proteins within 
this organism solely through the use of sequence-comparison methods. 


7.3 EXAMINATION OF THREE-DIMENSIONAL 
STRUCTURE ENHANCES OUR UNDERSTANDING 
OF EVOLUTIONARY RELATIONSHIPS 


Sequence comparison is a powerful tool for extending our knowledge of pro¬ 
tein function and kinship. However, biomolecules generally function as in- 
tricate three-dimensional structures rather than as linear polymers. Muta- 
tions occur at the levei of sequence, but the effects of the mutations are at 
the levei of function, and function is directly related to tertiary structure. 
Consequently, to gain a deeper understanding of evolutionary relationships 
between proteins, we must examine three-dimensional structures, especially 
in conjunction with sequence information. The techniques of structural de- 
termination are presented in Chapter 4. 

7.3.1 Tertiary Structure Is More Conserved Than 
Primary Structure 

Because three-dimensional structure is much more closely associated with 
function than is sequence, tertiary structure is more evolutionarily con¬ 
served than is primary structure. This conservation is apparent in the ter¬ 
tiary structures of the globins (Figure 7.13), which are extremely similar 
even though the similarity between human myoglobin and lupine leghe- 
moglobin is just barely detectable at the sequence levei and that between 
human hemoglobin {a chain) and lupine leghemoglobin is not statistically 


% FIGURE 7.13 Conservation of 
three-dimensíonal structure. The 

tertiary structures of human hemoglobin 
{a chain), human myoglobin, and lupine 
leghemoglobin are conserved. Each heme 
group contains an iron atom to which 
oxygen binds. 
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FIGURE 7.14 Structures of actín 
and the large fragment of heat shock 
protein 70 (Hsp-70). A comparison of the 
identically colored elements of secondary 
structure reveals the overall similarity in 
structure despite the difference in 
biochemical activities. 



Actin 


Hsp-70 


significant (15.6% identity). This structural similarity firmly establishes 
that the framework that binds the heme group and facilitates the reversible 
binding of oxygen has been conserved over a long evolutionary period. 

Anyone aware of the similar biochemical functions of hemoglobin, myo- 
globin, and leghemoglobin could expect the structural similarities. In a grow- 
ing number of other cases, however, a comparison of three-dimensional struc¬ 
tures has revealed striking similarities between proteins that were not 
expected to be related. A case in point is the protein actin, a major compo- 
nent of the cytoskeleton, and heat shock protein 70 (Hsp-70), which assists 
protein folding inside cells. These two proteins were found to be noticeably 
similar in structure despite only 15.6% sequence identity (Figure 7.14). On 
the basis of their three-dimensional structures, actin and Hsp-70 are paralogs. 
The levei of structural similarity strongly suggests that, despite their differ- 
ent biological roles in modern organisms, these proteins descended from a 
common ancestor. As the three-dimensional structures of more proteins are 
determined, such unexpected kinships are being discovered with increasing 
frequency. The search for such kinships relies ever more frequently on com- 
puter-based search procedures that allow the three-dimensional structure of 
any protein to be compared with all other known structures. 

7.3.2 Knowledge of Three-Dimensional Structures Can Aid 
in the Evaluation of Sequence Alignments 

The sequence-comparison methods described thus far treat all positions 
within a sequence equally. However, examination of families of homologous 
proteins for which at least one three-dimensional structure is known has re¬ 
vealed that regions and residues criticai to protein function are more strongly 
conserved than are other residues. For example, each type of globin con- 
tains a bound heme group with an iron atom at its center. A histidine residue 
that interacts directly with this iron (residue 64 in human myoglobin) is con¬ 
served in all globins. After we have identified key residues or highly con¬ 
served sequences within a family of proteins, we can sometimes identify 
other family members even when the overall levei of sequence similarity is 
below statistical significance. Thus, the generation of sequence templates — 
conserved residues that are structurally and functionally important and are 
characteristic of particular families of proteins—can be useful for recogniz- 
ing new family members that might be undetectable by other means. A va- 
riety of other methods for sequence classification that take advantage of 
known three-dimensional structures also are being developed. Still other 





methods are able to identify relatively conserved residues 
within a family of homologous proteins, even without a 
known three-dimensional structure. These methods are 
proving to be powerful in identifying distant evolutionary 
relationships. 

7.3.3 Repeated Motifs Can Be Detected 
by Aligning Sequences with Themselves 

More than 10% of all proteins contain sets of two or more 
domains that are similar to one another. The afore- 
described sequence search methods can often detect inter- 
nally repeated sequences that have been characterized in 
other proteins. Where repeated units do not correspond to 
previously identified domains, their presence can be de¬ 
tected by attempting to align a given sequence with itself. 

This alignment is most easily visualized with the use of a 
self-diagonal plot. Here, the protein sequence is displayed 
on both the vertical and the horizontal axes, running from 
amino to carboxyl terminus; a dot is placed at each point 
in the space defined by the axes at which the amino acid 
directly below along the horizontal axis is the same as that 
directly across along the vertical axis. The central diagonal 
represents the sequence aligned with itself. Internai repeats 
are manifested as lines of dots parallel to the central diag¬ 
onal, illustrated by the plot in Figure 7.15 prepared for the 
TATA-box-binding protein, a key protein in the initiation 
of gene transcription (Section 28.x.x). 

The statistical significance of such repeats can be tested by aligning the 
regions in question as if these regions were sequences from separate pro¬ 
teins. For the TATA-box-binding protein, the alignment is highly signifi- 
cant: 30% of the amino acids are identical over 90 residues (Figure 7.16A). 
The estimated probability of such an alignment occurring by chance is 1 in 
10^^. The determination of the three-dimensional structure of the TATA- 
box-binding protein confirmed the presence of repeated structures; the pro¬ 
tein is formed of two nearly identical domains (Figure 7.16B). The evidence 
is convincing that the gene encoding this protein evolved by duplication of 
a gene encoding a single domain. 



FIGURE 7.15 A self-diagonal plot for the TATA-box-binding 
protein from the plant Arabídopsís. Self-diagonal plots are 
used to search for amino acid sequence repeats within a protein. 
The central diagonal is the sequence aligned with itself. Red dots 
indicating a correspondence of amino acids appear where two or 
more amino acids in a row match. Lines of dots, highlighted in 
pink, parallel to the central diagonal suggest an internai repeat. 
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FIGURE 7.16 Sequence alignment 
of internai repeats. (A) An alignment 
of the sequences of the two repeats 
of the TATA-box-binding protein. The 
amino-terminal repeat is shown in 
green and the carboxyl-terminal repeat 
in blue. (B) Structure of the TATA-box- 
binding protein. The amino-terminal 
domain is shown in green and the 
carboxyl-terminal domain in blue. 






































FIGURE 7.17 Convergent evolution 
of protease active sites. The relative 
positions of the three key residues shown 
are nearly identical in the active sites of 
the serine proteases chymotrypsin and 
subtilisin. 


7.3.4 Convergent Evolution: Common Solutions 
to Biochemical Challenges 

Thus far, we have been exploring proteins derived from common ancestors— 
that is, through divergent evolution. In other cases, clear examples have been 
found of proteins that are structurally similar in important ways but are not 
descended from a common ancestor. How might two unrelated proteins come 
to resemble each other structurally? Two proteins evolving independently may 
have converged on a similar structure in order to perform a similar biochemi¬ 
cal activity. Perhaps that structure was an especially effective solution to a bio¬ 
chemical problem that organisms face. The process by which very different 
evolutionary pathways lead to the same solution is called convergent evolution. 

One example of convergent evolution is found among the serine pro¬ 
teases. These enzymes, to be discussed in more detail in Chapter 9, cleave 
peptide bonds by hydrolysis. Figure 7.17 shows for two such enzymes the 
structure of the active sites —that is, the sites on the proteins at which the 
hydrolysis reaction takes place. These active-site structures are remar kably 
similar. In each case, a serine residue, a histidine residue, and an aspartic 
acid residue are positioned in space in nearly identical arrangements. As we 
will see, this is the case because chymotrypsin and subtilisin use the same 
mechanistic solution to the problem of peptide hydrolysis. At first glance, 
this similarity might suggest that these proteins are homologous. However, 
striking differences in the overall structures of these proteins make an evo¬ 
lutionary relationship extremely unlikely (Figure 7.18). Whereas chy¬ 
motrypsin consists almost entirely of j8 sheets, subtilisin contains extensive 
n-helical structure. Moreover, the key serine, histidine, and aspartic acid 
residues do not occupy similar positions or even appear in the same order 
within the two sequences. It is extremely unlikely that two proteins evolv¬ 
ing from a common ancestor could have retained similar active-site struc¬ 
tures while other aspects of the structure changed so dramatically. 


FIGURE 7.18 Structures of chymotrypsin 
and subtilisin. The j8 strands are shown 
in yellow and a helices in blue. The overall 
structures are quite dissimilar, in stark 
contrast with the active sites, shown at 
the top of each structure. 



Chymotrypsin 



Subtilisin 


7.3.5 Comparison of RNA Sequences Can Be a Source 
of Insight into Secondary Structures 

A comparison of homologous RNA sequences can be a source of important 
insights into evolutionary relationships in a manner similar to that already 
described. In addition, such comparisons provide clues to the three- 
dimensional structure of the RNA itself. As noted in Chapter 5, single- 
stranded nucleic acid molecules fold back on themselves to form elaborate 
structures held together by Watson-Crick base-pairing and other interac- 
tions. In a family of sequences that form such base-paired structures, base 
sequences may vary, but base-pairing ability is conserved. Consider, for 
example, a region from a large RNA molecule present in the ribosomes of 




all organisms (Figures 7.19). In the region shown, the E. coli sequence has a 
guanine (G) residue in position 9 and a cytosine (C) residue in position 22, 
whereas the human sequence has uracil (U) in position 9 and adenine (A) in 
position 22. Examination of the six sequences shown in Figure 7.20 (and 
many others) reveals that the bases in positions 9 and 22 retain the ability to 
form a Watson-Crick base pair even though the identities of the bases in these 
positions vary. Base-pairing ability is also conserved in neighboring posi¬ 
tions; we can deduce that two segments with such compensating mutations 
are likely to form a double helix. Where sequences are known for several 
homologous RN A molecules, this type of sequence analysis can often sug- 
gest complete secondary structures as well as some additional interactions. 


(A) 

Escherichia coli 
BACTÉRIA , 

Pseudomonas aeruginosa 


{9 |22 

CACACGGCGGGUGCUAACGUCCGUCGUGAA 

ACCACGGCGGGUGCUAACGUCCGUCGUGAA 


ARCHAEA 


Halobacterium halobium 
Methanococcus vannielli 


CCGGUGUGCGGGG-UAAGCCUGUGCACCGU 

GAGGGCAUACGGG-UAAGCUGUAUGUCCGA 


Homosapiens GGGCCACUUUUGG-UAAGCAGAACUGGCGC 

Saccharomyces cerevisiae GGGCCAUUUUUGG-UAAGCAGAACUGGCGA 

FIGURE 7.19 Comparíson of RNA sequences. (A) A comparison of sequences in a part 
of ribosomal RNA taken from a variety of species. (B) The implied secondary structure. 

Bars indicate positions at which Watson-Crick base-pairing is completely conserved in the 
sequences shown, whereas dots indicate positions at which Watson-Crick base-pairing is 
consen/ed in most cases. 
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7.4 EVOLUTIONARY TREES CAN BE CONSTRUCTED 
ON THE BASIS OF SEQUENCE INFORMATION 


CONCEPTUAL INSICHTS, Sequence Analysis, offers insights into evolutionary 
I trees through Interactive analysis of simulated evoluntionary histories. 

The observation that homology is often manifested as sequence similarity 
suggests that the evolutionary pathway relating the members of a family of 
proteins may be deduced by examination of sequence similarity. This ap- 
proach is based on the notion that sequences that are more similar to one 
another have had less evolutionary time to diverge from 
one another than have sequences that are less similar. 

This method can be illustrated by using the three globin 
sequences in Figures 7.10 and 7.12, as well as the se¬ 
quence for the human hemoglobin j8 chain. These se¬ 
quences can be aligned with the additional constraint that 
gaps, if present, should be at the same positions in all of 
the proteins. These aligned sequences can be used to con- 
struct an evolutionary tree in which the length of the 
branch connecting each pair of proteins is proportional 
to the number of amino acid differences between the se¬ 
quences (Figure 7.20). 

Such comparisons reveal only the relative divergence 
times—for example, that myoglobin diverged from hemo¬ 
globin twice as long ago as the a chain diverged from the 
j8 chain. How can we estimate the approximate dates of gene 
duplications and other evolutionary events? Evolutionary 
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FIGURE 7.20 An evolutionary tree for globins. The branching 
structure was deduced by sequence comparison, whereas the 
resuits of fóssil studies provided the overall time scale showing 
when divergence occurred. 







FIGURE 7.21 The lamprey. A jawless 
fish whose ancestors diverged from bony 
fish approximately 400 million years 
ago, the lamprey contains hemoglobin 
molecules that contain oniy a single type 
of polypeptide chain. [Brent P. Kent] 


Homo sapiens Chimpanzee 



FIGURE 7.22 Placíng Neanderthal on 
an evolutíonary tree. Comparison of 
DNA sequences revealed that Neanderthal 
is not on the line of direct descent leading 
to Homo sapiens but, instead, branched 
off earlier and then became extinct. 


trees can be calibrated by comparing the deduced branch points with diver- 
gence times determined from the fóssil record. For example, the duplication 
leading to the two chains of hemoglobin appears to have occurred 350 million 
years ago. This estimate is supported by the observation that jawless fish such 
as the lamprey, which diverged from bony fish approximately 400 million 
years ago, contain hemoglobins built from a single type of subunit (Figure 7.21). 

These methods can be applied to both relatively modern and very an- 
cient molecules, such as the ribosomal RN As that are found in all organ- 
isms. Indeed, it was such an RN A sequence analysis that led to the sug- 
gestion that Archaea are a distinct group of organisms that diverged from 
Bactéria very early in evolutionary history. 


7.5 MODERN TECHNIQUES MAKE THE EXPERIMENTAL 
EXPLORATION OF EVOLUTION POSSIBLE 

Two techniques of biochemistry have made it possible to examine the course 
of evolution more directly and not simply by inference. The polymerase 
chain reaction (Section 6.1.5) allows the direct examination of ancient DNA 
sequences, releasing us, at least in some cases, from the constraints of be- 
ing able to examine existing genomes from living organisms only. Molecu¬ 
lar evolution may be investigated through the use of combinatorial chemistry, 
the process of producing large populations of molecules en masse and se- 
lecting for a biochemical property. This exciting process provides a glimpse 
into the types of molecules that may have existed in the RN A world. 

7.5.1 Ancient DNA Can Sometimes Be Amplified and Sequenced 

The tremendous chemical stability of DNA (Section 2.2.7) makes the mol- 
ecule well suited to its role as the storage site of genetic information. So sta- 
ble is the molecule that samples of DNA have survived for many thousands 
of years under appropriate conditions. With the development of PCR meth¬ 
ods, such ancient DNA can sometimes be amplified and sequenced. This 
approach has been applied to mitochondrial DNA from a Neanderthal fós¬ 
sil estimated at between 30,000 and 100,000 years of age found near Düs- 
seldorf, Germany, in 1856. Investigators managed to identify a total of 379 
bases of sequence. Comparison with a number of the corresponding se¬ 
quences from Homo sapiens revealed between 22 and 36 substitutions, con- 
siderably fewer than the average of 55 differences between human beings 
and chimpanzees over the common bases in this region. Further analysis 
suggested that the common ancestor of modern human beings and Nean- 
derthals lived approximately 600 million years ago. An evolutionary tree 
constructed by using these and other data revealed that the Neanderthal was 
not an intermediate between chimpanzees and human beings but, instead, 
was an evolutionary ‘‘dead end'' that became extinct (Figure 7.22). 

Note that earlier studies describing the sequencing of much more an¬ 
cient DNA such as that found in insects trapped in amber appear to have 
been flawed; contaminating modern DNA was responsible for the se¬ 
quences determined. Successful sequencing of ancient DNA requires suf- 
ficient DNA for reliable amplification and the rigorous exclusion of all 
sources of contamination. 

7.5.2 Molecular Evolution Can Be Examined Experimentally 

Evolution requires three processes: (1) the generation of a diverse popula- 
tion, (2) the selection of members based on some criterion of fitness, and 
(3) reproduction to enrich the population in more fit members (Section 2.2). 




Nucleic acid molecules are capable of undergoing all three processes in vitro I 185 | 

under appropriate conditions. The results of such studies enable us to glimpse Experimental Exploration of Evolution 

how evolutionary processes might have generated catalytic activities and spe- 
cific binding abilities—important biochemical functions in all living systems. 

A diverse population of nucleic acid molecules can be synthesized in the 
laboratory by the process of combinatorial chemistry, which rapidly pro- 
duces large populations of a particular type of molecule such as a nucleic 
acid. A population of molecules of a given size can be generated randomly 
so that many or all possible sequences are present in the mixture. When an 
initial population has been generated, it is subjected to a selection process 
that isolates specific molecules with desired binding or reactivity proper- 
ties. Finally, molecules that have survived the selection process are allowed 
to reproduce through the use of PCR; primers are directed toward specific 
sequences included at the ends of each member of the population. 

As an example of this approach, consider an experiment that set a goal 
of creating an RNA molecule capable of binding adenosine triphosphate 
and related nucleotides. Such ATP-bonding molecules are of interest be- 
cause they might have been present in the RNA world. An initial popula¬ 
tion of RNA molecules 169 nucleotides long was created; 120 of the posi- 
tions differed randomly, with equimolar mixtures of adenine, cytosine, 
guanine, and uracil. The initial synthetic pool that was used contained ap- 
proximately 10^"^ RNA molecules. Note that this number is a very small 
fraction of the total possible pool of random 120-base sequences. From this 
pool, those molecules that bound to ATP, which had been immobilized on 
a column, were selected (Figure 7.23). 


Randomized RNA pool 

r «V RNA pool 
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FIGURE 7.23 Evolution in the laboratory. 

A collection of RNA molecules of random 
sequences is synthesized by combinatorial 
chemistry. This collection is selected for 
the ability to bind ATP by passing the RNA 
through an ATP affinity column (Section 
4.1.3). The ATP-binding RNA molecules are 
released from the column by washing with 
excess ATP, and replicated. The process of 
selection and replication is then repeated 
several times. The final RNA products with 
significant ATP-binding ability are isolated 
and characterized. 
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FIGURE 7.24 A conserved secondary 
structure. The secondary structure shown 
is common to RNA molecules selected for 
ATP binding. 


The collection of molecules that were bound well by the ATP affinity 
column was allowed to reproduce by rever se transcription into DNA, am- 
plification by PCR, and transcription back into RNA. This new population 
was subjected to additional rounds of selection for ATP-binding activity. 
After eight generations, members of the selected population were charac- 
terized by sequencing. Seventeen different sequences were obtained, 16 of 
which could form the structure shown in Figure 7.24. Each of these mole¬ 
cules bound ATP with high affinity, as indicated by dissociation constants 
less than 50 jaM. 

The folded structure of the ATP-binding region from one of these RNAs 
was determined by nuclear magnetic resonance (Section 4.5.1) methods 
(Figure 7.25). As expected, this 40-nucleotide molecule is composed of two 
Watson-Crick base-paired helical regions separated by an 11-nucleotide 
loop. This loop folds back on itself in an intricate way to form a deep pocket 
into which the adenine ring can fit. Thus, a structure was generated, or 
evolved, that was capable of a specific interaction. 
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FIGURE 7.25 RNA molecule binds ATP. 

(A) The Watson-Crick base-pairing pattern, 

(B) the folding pattern, and (C) a surface 
representation of an RNA molecule selected 
to bind adenosine nucleotides. The bound 
ATP is shown in part B, and the binding 
site is revealed as a deep pocket in part C. 


SUMMARY 


Homologs Are Descended from a Common Ancestor 

Exploring evolution biochemically often means searching for homol- 
ogy, or relatedness, between molecules, because homologous molecules, 
or homologs, evolved from a common ancestor. Paralogs are homolo¬ 
gous molecules that are found in one species and have acquired differ¬ 
ent functions through evolutionary time. Orthologs are homologous 
molecules that are found in different species and have similar or iden- 
tical functions. 

o Statistical Analysis of Sequence Alignments Can Detect Homology 

Protein and nucleic acid sequences are two of the primary languages of 
biochemistry. Sequence-alignment methods are the most powerful tools 
of the evolutionary detective. Sequences can be aligned to maximize 
their similarity, and the significance of these alignments can be judged 
by statistical tests. The detection of a statistically significant alignment 
between two sequences strongly suggests that two sequences are related 
by divergent evolution from a common ancestor. The use of substitu- 
tion matrices makes the detection of more distant evolutionary rela- 
tionships possible. Any sequence can be used to probe sequence data- 
bases to identify related sequences present in the same organism or in 
other organisms. 

o Examination of Three-Dimensional Structure Enhances Our Understanding 
of Evolutionary Relationships 

The evolutionary kinship between proteins may be even more pro- 
foundly evident in the conserved three-dimensional structures. The 
analysis of three-dimensional structure in combination with analysis of 
especially conserved sequences has made it possible to determine evo¬ 
lutionary relationships that are not possible to detect by other means. 
Sequence-comparison methods can also be used to detect imperfectly re- 
peated sequences within a protein, indicative of linked similar domains. 

i) Evolutionary Trees Can Be Constructed on the Basis of Sequence 
Information 

Construction of an evolutionary tree based on sequence comparisons 
revealed approximate times for the gene duplication events separating 
myoglobin and hemoglobin as well as the a and j8 subunits of hemo- 
globin. Evolutionary trees based on sequences can be compared to those 
based on fóssil records. 
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Possible Selected Readings 

The exploration of evolution can also be a laboratory Science. In favor- 

able cases, PCR amplification of well-preserved samples allows the de- 

termination of the nucleotide sequences from extinct organisms. Se- 

quences so determined can help authenticate aspects of an evolutionary 

tree constructed by other means. Molecular evolutionary experiments 

performed in the test tube can examine how molecules such as ligand- 

binding RN A molecules might have been generated. 
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-1 PROBLEMS I- 

1. What's the score? Using the identity-based scoring system 
(Section 7.2), calculate the score for the foliowing alignment. Do 
you think the score is statistically significant? 

(1) WYLGKITRMDAEVLLKKPTVRDGHFLVTQCESSPGEF- 

(2) WYFGKITRRESERLLLNPENPRGTFLVRESETTKGAY- 

SISVRFGDSVQ-HFKVLRDQNGKYYLWAVK-FN- 

CLSVSDFDNAKGLNVKHYKIRKLDSGGFYITSRTQFS- 

SLNELVAYHRTASVSRTHTILLSDMNV 

SSLQQLVAYYSKHADGLCHRLTNV 

2. Sequence and structure. A comparison of the aligned amino 
acid sequences of two proteins each consisting of 150 amino acids 
reveals them to be only 8% identical. However, their three- 
dimensional structures are very similar. Are these two proteins 
related evolutionarily? Explain. 

3. It depends on how you count. Consider the following two se¬ 
quence alignments: 

(a) A-SNLFDIRLIG (b) ASNLFDIRLI-G 

GSNDFYEVKIMD GSNDFYEVKIMD 

Which alignment has a higher score if the identity-based scor¬ 
ing System (Section 7.2) is used? Which alignment has a higher 
score if the Blosum-62 substitution matrix (Figure 7.9) is used? 

4. Discovering a new base pair. Examine the ribosomal RNA 
sequences in Figure 7.19. In sequences that do not contain 
Watson-Crick base pairs, what base tends to be paired with G? 
Propose a structure for your new base pair. 

5. Overwhelmed by numbers. Suppose that you wish to synthe- 
size a pool of RNA molecules that contain all four bases at each 
of 40 positions. How much RNA must you have in grams if the 
pool is to have at least a single molecule of each sequence? The 
average molecular weight of a nucleotide is 330 g moP^. 

6. Form follows function. The three-dimensional structure of 
biomolecules is more conserved evolutionarily than is sequence. 
Why is this the case? 

7. Shuffling. Using the identity-based scoring system (Section 
7.2), calculate the alignment score for the alignment of the fol¬ 
lowing two short sequences: 

(1) ASNFLDKAGK (2) ATDYLEKAGK 


Generate a shuffled version of sequence 2 by randomly reorder- 
ing these 10 amino acids. Align your shuffled sequence with se¬ 
quence 1 without alio wing gaps, and calculate the alignment 
score between sequence 1 and your shuffled sequence. 

8. Interpreting the score. Suppose that the sequences of two pro - 
teins each with 200 amino acids are aligned and that the per- 
centage of identical residues has been calculated. How would you 
interpret each of the following results in regard to the possible 
divergence of the two proteins from a common ancestor? 

(a) 80%, (b) 50%, (c) 20%, (d) 10% 

9. A set of three. The sequences of three proteins (A, B, and C) 
are compared with one another, yielding the following leveis of 
identity: 



A 

B 

C 

A 

100% 

65% 

15% 

B 

65% 

100% 

55% 

C 

15% 

55% 

100% 


Assume that the sequence matches are distributed relatively uni- 
formly along each aligned sequence pair. Would you expect pro- 
tein A and protein C to have similar three-dimensional struc¬ 
tures? Explain. 

10. RNA alignment. Sequences of an RNA fragment from five 
species have been determined and aligned. Propose a likely sec- 
ondary structure for these fragments. 

(1) UUGGAGAUUCGGUAGAAUCUCCC 

(2) GCCGGGAAUCGACAGAUUCCCCG 

(3) CCCAAGUCCCGGCAGGGACUUAC 

(4) CUCACCUGCCGAUAGGCAGGUCA 

(5) AAUACCACCCGGUAGGGUGGUUC 

Media Problem 

11. Evolutionary time machine. It has been suggested that an¬ 
cestral protein sequences might be inferred from evolutionary 
trees of sequences that exist today. The Conceptual Insights 
module on sequence analysis allows you to try your hand at in- 
ferring ancestral sequences from model evolutionary trees. Based 
on this experience, explain why you would not expect to be able 
to successfully infer dinosaur sequences. 
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Enzymes: Basic Concepts and 


Kinetics 





The actívity of an enzyme is responsíble for the glow of the 
lumínescent jeilyfish at left. The enzyme aequorin catalyzes the 
oxidation of a compound by oxygen in the presence of calcium to 
release CO 2 and light. [(Left) Fred Bavendam/Peter Arnold.] 



OUTLINE 


Enzymes, the catalysts of biological systems, are remarkable molecular de- 
vices that determine the patterns of chemical transformations. They also 
mediate the transformation of one form of energy into another. The most 
striking characteristics of enzymes are their catalytic power 
and specificity. Catalysis takes place at a particular site on 
the enzyme called the active site. Nearly all known enzymes 
are proteins. However, proteins do not have an absolute mo- 
nopoly on catalysis; the discovery of catalytically active 
RNA molecules provides compelling evidence that RNA 
was an early biocatalyst (Section 2.2.2). 

Proteins as a class of macromolecules are highly effec- 
tive catalysts for an enormous diversity of chemical reac- 
tions because of their capacity to specifically bind a very wide 
range of molecules. By utilizing the full repertoire of inter- 
molecular forces, enzymes bring substrates together in an 
optimal orientation, the prelude to making and breaking 
chemical bonds. They catalyze reactions by stabilizing tran- 
sition States, the highest-energy species in reaction path- 
ways. By selectively stabilizing a transition state, an enzyme 
determines which one of several potential chemical reac¬ 
tions actually takes place. 


8.1 Enzymes Are PowerfuI and Highly 
Specific Catalysts 

8.2 Free Energy Is a UsefuI 
Thermodynamic Function 
for Understanding Enzymes 

8.3 Enzymes Accelerate Reactions 
by Facilitating the Formation of the 
Transition State 

8.4 The Michaelis-Menten Model 
Accounts for the Kinetic Properties 
of Many Enzymes 

8.5 Enzymes Can Be Inhibited 
by Specific Molecules 

8.6 Vitamins Are Often Precursors 
to Coenzymes 
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8.1 ENZYMES ARE POWERFUL AND HIGHLY 
SPECIFIC CATALYSTS 


o 

II 

C + H 2 O 

o 



Enzymes accelerate reactions by factors of as much as a million or more 
(Table 8.1). Indeed, most reactions in biological systems do not take place 
at perceptible rates in the absence of enzymes. Even a reaction as simple as 
the hydration of carbon dioxide is catalyzed by an enzyme—namely, car- 
bonic anhydrase (Section 9.2). The transfer of CO 2 from the tissues into the 
blood and then to the alveolar air would be less complete in the absence of 
this enzyme. In fact, carbonic anhydrase is one of the fastest enzymes 
known. Each enzyme molecule can hydrate 10^ molecules of CO 2 per sec- 
ond. This catalyzed reaction is 10^ times as fast as the uncatalyzed one. We 
will consider the mechanism of carbonic anhydrase catalysis in Chapter 9. 
Enzymes are highly specific both in the reactions that they catalyze and in 
their choice of reactants, which are called substrates. An enzyme usually cat- 
alyzes a single chemical reaction or a set of closely related reactions. Side re¬ 
actions leading to the wasteful formation of by-products are rare in 
enzyme-catalyzed reactions, in contrast with uncatalyzed ones. 

Let us consider proteolytic enzymes as an example. In vivo, these enzymes 
catalyze proteo/ysis, the hydrolysis of a peptide bond. 




.0 + +H3N. 


O 





Peptide Carboxyl Amino 

component component 

Most proteolytic enzymes also catalyze a different but related reaction in 
vitro—namely, the hydrolysis of an ester bond. Such reactions are more 
easily monitored than is proteolysis and are useful in experimental investi- 
gations of these enzymes (Section 9.1.2). 


/rO HO 

X ^R2 + H2O . S _ + *^2 + H+ 

o O 


Ester Acid Alcohol 

Proteolytic enzymes differ markedly in their degree of substrate speci- 
ficity. Subtilisin, which is found in certain bactéria, is quite undiscriminat- 
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TABLE 8.1 Rate enhancement by selected enzymes 





Uncatalyzed 

Catalyzed 

Rate 


Nonenzymatic 

rate 

rate 

enhancement 

Enzyme 

half-life 


(feunS-^) 

(feca. 

(^cat/^un) 

OMP decarboxylase 

78,000,000 

years 

2.8 X 10“^*’ 

39 

1.4 X 10 '^ 

Staphylococcal nuclease 

130,000 

years 

1.7 X 10“^^ 

95 

5.6 X 10^^ 

AMP nucleosidase 

69,000 

years 

1.0 X 10“” 

60 

6.0 X 10^2 

Carboxypeptidase A 

7.3 

years 

3.0 X 10“’ 

578 

1.9 X 10 " 

Ketosteroid isomerase 

7 

weeks 

1.7 X 10“^ 

66,000 

3.9 X 10 " 

Triose phosphate isomerase 

1.9 

days 

4.3 X 10“^ 

4,300 

1.0 X 10^ 

Chorismate mutase 

7.4 

hours 

2.6 X 10“® 

50 

1.9 X 10* 

Carbonic anhydrase 

5 

seconds 

1.3 X 10“^ 

1 X 10^ 

7.7 X 10* 


Ahhreviations: OMP, orotidine monophosphate; AMP, adenosine monophosphate. 
Source: After A. Radzicka and R. Wofenden. Science 267 (199 5):90-93. 

















ing: it will cleave any peptide bond with little regar d to the identity of the 
adjacent side chains. Trypsin, a digestive enzyme, is quite specific and cat- 
alyzes the splitting of peptide bonds only on the carboxyl side of lysine and 
arginine residues (Figure 8.1 A). Thrombin, an enzyme that participates in 
blood clotting, is even more specific than trypsin. It catalyzes the hydroly- 
sis of Arg-Gly bonds in particular peptide sequences only (Figure 8.1B). 

DNA polymerase I, a template-directed enzyme (Section 27.x.x), is an- 
other highly specific catalyst. It adds nucleotides to a DNA strand that is be- 
ing synthesized, in a sequence determined by the sequence of nucleotides in 
another DNA strand that serves as a template. DNA polymerase I is remark- 
ably precise in carrying out the instructions given by the template. It inserts 
the wrong nucleotide into a new DNA strand less than one in a million times. 

The specxficity of an enzyme is due to the precise interaction of the substrate 
with the enzyme. This precision is a result of the intricate three-dimensional 
structure of the enzyme protein. 

8.1.1 Many Enzymes Require Cofactors for Activity 

The catalytic activity of many enzymes depends on the presence of small 
molecules termed cofactors, although the precise role varies with the cofac- 
tor and the enzyme. Such an enzyme without its cofactor is referred to as an 
apoenzyme; the complete, catalytically active enzyme is called a holoenzyme. 

Apoenzyme + cofactor = holoenzyme 

Cofactors can be subdivided into two groups: metais and small organic mole¬ 
cules (Table 8.2). The enzyme carbonic anhydrase, for example, requires 
Zn^^ for its activity (Section 9.2.1). Glycogen phosphorylase (Section 21.1.5), 
which mobilizes glycogen for energy, requires the small organic molecule 
pyridoxal phosphate (PLP). 




FIGURE 8.1 Enzyme specíficíty. 

(A) Trypsin cleaves on the carboxyl side 
of arginine and lysine residues, whereas 

(B) thrombin cleaves Arg-Gly bonds in 
particular sequences specifically. 
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TABLE 8.2 Enzyme cofactors 


Cofactor 

Enzyme 

Coenzyme 

Thiamine pyrophosphate 

Pyruvate dehydrogenase 

Flavin adenine nucleotide 

Monoamine oxidase 

Nicotinamide adenine dinucleotide 

Lactate dehydrogenase 

Pyridoxal phosphate 

Glycogen phosphorylase 

Coenzyme A (CoA) 

Acetyl CoA carboxylase 

Biotin 

Pyruvate carboxylase 

5'-Deoxyadenosyl cobalamin 

Methylmalonyl mutase 

Tetrahydrofolate 

Thymidylate synthase 

Metal 

Zn2+ 

Carbonic anhydrase 

Zn2+ 

Carboxypeptidase 

Mg2+ 

EcoRV 

Mg2+ 

Hexokinase 


Urease 

Mo 

Nitrate reductase 

Se 

Glutathione peroxidase 

Mn2+ 

Superoxide dismutase 


Propionyl CoA carboxylase 
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the enzyme. If tightly bound, they are called prosthetic groups. Loosely as- 
sociated coenzymes are more like cosubstrates because they bind to and are 
released from the enzyme just as substrates and products are. The use of the 
same coenzyme by a variety of enzymes and their source in vitamins sets 
coenzymes apart from normal substrates, however. Enzymes that use the 
same coenzyme are usually mechanistically similar. In Chapter 9, we will 
examine the mechanistic importance of cofactors to enzyme activity. A more 
detailed discussion of coenzyme vitamins can be found in Section 8.6. 



8.1.2 Enzymes May Transform Energy 
from One Form into Another 

In many biochemical reactions, the energy of the reactants 
is converted with high efficiency into a different form. For ex- 
ample, in photosynthesis, light energy is converted into 
chemical-bond energy through an ion gradient. In mito- 
chondria, the free energy contained in small molecules de¬ 
rived from food is converted first into the free energy of an 
ion gradient and then into a different currency, the free en¬ 
ergy of adenosine triphosphate. Enzymes may then use the 
chemical-bond energy of ATP in many ways. The enzyme 
myosin converts the energy of ATP into the mechanical en¬ 
ergy of contracting muscles. Pumps in the membranes of 
cells and organelles, which can be thought of as enzymes 
that move substrates rather than chemically altering them, 
create chemical and electrical gradients by using the energy 
of ATP to transport molecules and ions (Figure 8.2). The 
molecular mechanisms of these energy -transducing en¬ 
zymes are being unraveled. We will see in subsequent chap- 
ters how unidirectional cycles of discrete steps—binding, 
Chemical transformation, and release—lead to the conver- 
sion of one form of energy into another. 

8.1.3 Enzymes Are Classified on the Basis 

of the Types of Reactions That They Catalyze 


FIGURE 8.2 An energy-transforming enzyme. ATPase Many enzymes have common names that provide little 

uses the energy of ATP hydrolysis to transport Ca across the • r . ♦ u 4 . 4 .* i ü 

. 2+ . Information about the reactions that they catalyze. ror 

membrane, generating a Ca gradient. . 

example, a proteolytic enzyme secreted by the pancreas 

is called trypsin. Most other enzymes are named for their substrates and 

for the reactions that they catalyze, with the suffix ‘‘ase’' added. Thus, an 

ATPase is an enzyme that breaks down ATP, whereas ATP synthase is 

an enzyme that synthesizes ATP. 

To bring some consistency to the classification of enzymes, in 1964 
the International Union of Biochemistry established an Enzyme Commis- 
sion to develop a nomenclature for enzymes. Reactions were divided into 
six major groups numbered 1 through 6 (Table 8.3). These groups were 
subdivided and further subdivided, so that a four-digit number preceded 
by the letters EC for Enzyme Commission could precisely identify all 
enzymes. 

Consider as an example nucleoside monophosphate (NMP) kinase, an 
enzyme that we will examine in detail in the next chapter (Section 9.4). It 
catalyzes the foliowing reaction: 


ATP + NMP 


± ADP + NDP 
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TABLE 8.3 Six major classes of enzymes 


Class 

Type of reaction 

Example 

Chapter 

1. Oxidoreductases 

Oxidation - r eduction 

Lactate dehydrogenase 

16 

2. Transferases 

Group transfer 

Nucleoside monophosphate 
kinase (NMP kinase) 

9 

3. Hydrolases 

Hydrolysis reactions (transfer of 
functional groups to water) 

Chymotrypsin 

9 

4. Lyases 

Addition or removal of groups to form 
double bonds 

Fumarase 

18 

5. Isomerases 

Isomerization (intramolecular group 
transfer) 

Triose phosphate isomerase 

16 

6. Ligases 

Ligation of two substrates at the expense 
of ATP hydrolysis 

Aminoacyl-tRNA synthetase 

29 


NMP kinase transfers a phosphoryl group from ATP to NMP to form a 
nucleoside diphosphate (NDP) and ADR Consequently, it is a transferase, 
or member of group 2. Many groups in addition to phosphoryl groups, such 
as sugars and carbon units, can be transferred. Transferases that shift a phos¬ 
phoryl group are designated 2.7. Various functional groups can accept the 
phosphoryl group. If a phosphate is the acceptor, the transferase is desig¬ 
nated 2.7.4. The final number designates the acceptor more precisely. In re- 
gard to NMP kinase, a nucleoside monophosphate is the acceptor, and the 
enzyme’s designation is EC 2.7.4.4. Although the common names are used 
routinely, the classification number is used when the precise identity of the 
enzyme might be ambiguous. 


8.2 FREE ENERGY IS A USEFUL THERMODYNAIVIIC 
FUNCTION FOR UNDERSTANDING ENZYMES 

Some of the principies of thermodynamics were introduced in Chapter 1— 
notably the idea oífree energy (G).To fully understand how enzymes oper- 
ate, we need to consider two thermodynamic properties of the reaction: 
(1) the free-energy difference (AG) between the products and reactants 
and (2) the energy required to initiate the conversion of reactants to prod¬ 
ucts. The former determines whether the reaction will be spontaneous, 
whereas the later determines the rate of the reaction. Enzymes affect only 
the latter. First, we will consider the thermodynamics of reactions and then, 
in Section 8.3, the rates of reactions. 

8.2.1 The Free-Energy Change Provides Information About the 
Spontaneity but Not the Rate of a Reaction 

As stated in Section 1.3.3, the free-energy change of a reaction (AG) tells us 
if the reaction can occur spontaneously: 

1. A reaction can occur spontaneously only if AG is negative. Such reactions 
are said to be exergonic. 

2. A System is at equilibrium and no net change can take place if AG is zero. 

3. A reaction cannot occur spontaneously if AG is positive. An input of free 
energy is required to drive such a reaction. These reactions are termed 
endergonic. 
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Units of energy— 

A calorie (cal) is equivalent to the 
amount of heat required to raise the 
temperature of 1 gram of water from 
14.5°C to 15.5°C 
A kilocalorie (kcal) is equal to 
1000 cal. 

A joule (J) is the amount of energy 
needed to appiy a 1-newton force over 
a distance of 1 meter. 

A kilojoule (kJ) is equal to 1000 J. 

1 kcal = 4.184 kJ 


Two additional points need to be emphasized. The AG of a reaction de- 
pends only on the free energy of the products (the final State) minus the free 
energy of the reactants (the initial State). The AG of a reaction is independ- 
ent of the path (or molecular mechanism) of the transformation. The mecha- 
nism of a reaction has no effect on AG. For example, the AG for the oxida- 
tion of glucose to CO 2 and H 2 O is the same whether it occurs by combustion 
in vitro or by a series of enzyme-catalyzed steps in a cell. The AG provides 
no information about the rate of a reaction. A negative AG indicates that a 
reaction can occur spontaneously, but it does not signify whether it will pro- 
ceed at a perceptible rate. As will be discussed shortly (Section 8.3), the rate 
of a reaction depends on the free energy of activation (AG"^), which is largely 
unrelated to the AG of the reaction. 

8.2.2 The Standard Free-Energy Change of a Reaction Is 
Related to the Equilibrium Constant 

As for any reaction, we need to be able to determine AG for an enzyme- 
catalyzed reaction in order to know whether the reaction is spontaneous or 
an input of energy is required. To determine this important thermodynamic 
parameter, we need to take into account the nature of both the reactants and 
the products as well as their concentrations. 

Consider the reaction 


A + + D 


The AG of this reaction is given by 


AG = AG° + RTln 


[C][D] 

[A][B] 


( 1 ) 


in which AG° is the standard free-energy change, R is the gas constant, T is 
the absolute temperature, and [A], [B], [C], and [D] are the molar concen¬ 
trations (more precisely, the activities) of the reactants. AG° is the free- 
energy change for this reaction under standard conditions—that is, when 
each of the reactants A, B, C, and D is present at a concentration of 1.0 M 
(for a gas, the standard state is usually chosen to be 1 atmosphere). Thus, 
the AG of a reaction depends on the nature of the reactants (expressed in the 
AG° term of equation 1) and on their concentrations (expressed in the log- 
arithmic term of equation 1). 

A convention has been adopted to simplify free-energy calculations for 
biochemical reactions. The standard state is defined as having a pH of 7. 
Consequently, when is a reactant, its activity has the value 1 (corre- 
sponding to a pH of 7) in equations 1 and 4 (below). The activity of water 
also is taken to be 1 in these equations. The standard free-energy change at 
pH 7, denoted by the symbol AG°' will be used throughout this book. The 
kilocalorie (abbreviated kcal) and the kilojoule (kJ) will be used as the units 
of energy. One kilocalorie is equivalent to 4.184 kilojoules. 

The relation between the standard free energy and the equilibrium con¬ 
stant of a reaction can be readily derived. This equation is important because 
it displays the energetic relation between products and reactants in terms of 
their concentrations. At equilibrium, AG = 0. Equation 1 then becomes 


0 = AG°' 


+ RTln 


[C][D] 

[A][B] 


( 2 ) 


and so 


AG°' 


-RTln 


[C][D] 

[A][B] 


( 3 ) 












The equilibrium constant under standard conditions, is defined as 


[C][D] 

[A][B] 


( 4 ) 


Substituting equation 4 into equation 3 gives 
AG°' = -RTlnK'eq 
AG°' = -2.303RTlogioiC'eq 
which can be rearranged to give 

_ |q-AG°7(2.303RT) 


( 5 ) 

( 6 ) 

( 7 ) 


Substituting R = 1.987 X 10 ^ kcal mol ^ deg ^ and T = 298 K (corre- 
sponding to 25°C) gives 


K' 


eq 


|q-AG°71.36 


( 8 ) 


where AG°' is here expressed in kilocalories per mole because of the choice 
of the units for R in equation 7. Thus, the standard free energy and the equi¬ 
librium constant of a reaction are related by a simple expression. For ex- 
ample, an equilibrium constant of 10 gives a standard free-energy change 
of —1.36 kcal moP^ ( — 5.69 kj moP^) at 25°C (Table 8.4). Note that, for 
each 10-fold change in the equilibrium constant, the AG°' changes by 
1.36 kcal moP^ (5.69 kJ moP^). 

As an example, let us calculate AG°' and AG for the isomerization of 
dihydroxyacetone phosphate (DHAP) to glyceraldehyde 3-phosphate 
(GAP). This reaction takes place in glycolysis (Section 16.1.4). At equilibrium, 
the ratio of GAP to DHAP is 0.0475 at 25°C (298 K) and pH 7. Hence, 
— 0.0475. The standard free-energy change for this reaction is then 
calculated from equation 6: 

AG°' = -2.303]?TlogioíC'eq 

= -2.303 X 1.987 X 10“^ X 298 X logio(0.0475) 

= +1.80 kcal moD^(7.53 kJ moD^) 


Under these conditions, the reaction is endergonic. DHAP will not spon- 
taneously convert to GAP. 

Now let us calculate AG for this reaction when the initial concentration 
of DHAP is2XlO~"^M and the initial concentration of GAP is 3 X 
10~^ M. Substituting these values into equation 1 gives 

3 X 10~^M 

AG = 1.80 kcal moU^ + 2.303RTlogio- ttt 

^^^2 X 10“^ M 

= 1.80 kcal moU^ — 2.49 kcal moU^ 

= —0.69 kcal moU^ ( — 2.89 kJ moU^) 

This negative value for the AG indicates that the isomerization of DHAP 
to GAP is exergonic and can occur spontaneously when these species are 
present at the aforestated concentrations. Note that AG for this reaction is 
negative, although AG is positive. It is important to stress that whether the 
AG for a reaction is larger, smaller, or the same as AG depends on the con¬ 
centrations of the reactants and products. The criterion of spontaneity for a 
reaction is AG, not AG . This point is important because reactions that are 
not spontaneous based on AG can be made spontaneous by adjusting the 
concentrations of reactants and products. This principie is the basis of the 
coupling of reactions to form metabolic pathways (Chapter 14). 


r 


TABLE 8.4 Relation between AG""' 
and (at 25X) 


k;. 


AG°' 

kcal mol 

kJ/mol“^ 

10“^ 

6.82 

28.53 

10““* 

5.46 

22.84 

10“^ 

4.09 

17.11 

10“2 

2.73 

11.42 

10“' 

1.36 

5.69 

1 

0 

0 

10 

-1.36 

-5.69 

10^ 

-2.73 

-11.42 

10^ 

-4.09 

-17.11 

IO"* 

-5.46 

-22.84 

10^ 

-6.82 

-28.53 


O 

II 

HO^ /C ^OPO,^- 

H2 H2 

Dihydroxyacetone 

phosphate 

(DHAP) 


I H 
H 


OH 

I 


"C 

Hz 


/OPOj^ 


Glyceraldehyde 

3-phosphate 

(GAP) 
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Transition State, 



Product 


Reaction progress —> 

FIGURE 8.3 Enzymes decrease the 
actívatíon energy. Enzymes accelerate 
reactions by decreasing AC*", the free 
energy of activation. 


8.2.3 Enzymes Alter Only the Reaction Rate and Not the 
Reaction Equilibrium 

Because enzymes are such superb catalysts, it is tempting to ascribe to them 
powers that they do not have. An enzyme cannot alter the laws of thermo- 
dynamics and consequently cannot alter the equilibrium of a chemical reac¬ 
tion. This inability means that an enzyme accelerates the forward and re¬ 
ver se reactions by precisely the same factor. Consider the interconversion 
of A and B. Suppose that, in the absence of enzyme, the forward rate con- 
stant (fep) is s~^ and the reverse rate constant (fe^) is s~\ The 
equilibrium constant K is given by the ratio of these rate constants: 


A 




B 


M _ ^ 

[a] /jr 10 ^ 


The equilibrium concentration of B is 100 times that of A, whether or not 
enzyme is present. However, it might take considerable time to approach 
this equilibrium without enzyme, whereas equilibrium would be attained 
rapidly in the presence of a suitable enzyme. Enzymes accelerate the attain- 
ment of equilibria but do not shift their positions. The equilibrium position is 
a function only of the free-energy difference between reactants and products. 


8.3 ENZYMES ACCELERATE REACTIONS BY FACILITATING 
THE FORMATION OF THE TRANSITION STATE 

The free-energy difference between reactants and products accounts for the 
equilibrium of the reaction, but enzymes accelerate how quickly this equi¬ 
librium is attained. How can we explain the rate enhancement in terms of 
thermodynamics? To do so, we have to consider not the end points of the 
reaction but the chemical pathway between the end points. 

A chemical reaction of substrate S to form product P goes through a 
transition State that has a higher free energy than does either S or P. The 
double dagger denotes a thermodynamic property of the transition state. 
The transition state is the most seldom occupied species along the reaction 
pathway because it is the one with the highest free energy. The difference 
in free energy between the transition state and the substrate is called the 
Gibbs free energy of activation or simply the activation energy, symbolized 
by AGf as mentioned in Section 8.2.1 (Figure 8.3). 

AG^ = Gst - Gs 

Note that the energy of activation, or AG"^, does not enter into the final AG 
calculation for the reaction, because the energy input required to reach the 
transition state is returned when the transition state forms the product. The 
activation-energy barrier immediately suggests how enzymes enhance re¬ 
action rate without altering AG of the reaction: enzymes function to lower 
the activation energy, or, in other words, enzymes facilitate the formation of 
the transition state. 

One approach to understanding how enzymes achieve this facilitation is 
to assume that the transition state (S"*") and the substrate (S) are in equilibrium. 

^ V 

S ->P 

in which is the equilibrium constant for the formation of and v is the 
rate of formation of product from 
















The rate of the reaction is proportional to the concentration of 

Rate ^ [S^] 

because only can be converted into product. The concentration of is 
in turn related to the free energy difference between and S, because these 
two Chemical species are assumed to be in equilibrium; the greater the dif¬ 
ference between these two States, the smaller the amount of 

Because the reaction rate is proportional to the concentration of and 
the concentration of depends on AG"^, the rate of reaction V depends on 
AG"^. Specifically, 

h 

In this equation, k is Boltzmann s constant, and h is Planck^s constant. The 
value of feT/h at 25°C is 6.2 X 10^^ s~b Supposethat the free energy ofac- 
tivation is 6.82 kcal moP^ (28.53 kj moP^). The ratio [S'^]/[S] is then 
(see Table 8.4). If we assume for simplicity s sake that [S] = IM, then the 
reaction rate V is 6.2 X 10^ s~b If AG"^ were lowered by 1.36 kcal mol ^ 
(5.69 kJ moP^), the ratio [S'^]/[S] is then and the reaction rate would 
be 6.2 X 10^ s~\ As Table 8.4 shows, a decrease of 1.36 kcal moP^ in AG"^ 
yields a tenfold larger V. A relatively small decrease in AG"^ (20% in this 
particular reaction) results in a much greater increase in V. 

Thus, we see the key to how enzymes operate: Enzymes accelerate reac- 
tions by decreasing , the activation energy. The combination of substrate 
and enzyme creates a new reaction pathway whose transition-state energy 
is lower than that of the reaction in the absence of enzyme (see Figure 8.3). 
The lower activation energy means that more molecules have the required 
energy to reach the transition state. Decreasing the activation barrier is anal- 
ogous to lowering the height of a high-jump bar; more athletes will be able 
to clear the bar. The essence of catalysis is specific binding of the transition 
State. 

8.3.1 The Formation of an Enzyme-Substrate Complex Is the 
First Step in Enzymatic Catalysis 

Much of the catalytic power of enzymes comes from their bringing sub- 
strates together in favorable orientations to promote the formation of the 
transition States in enzyme-substrate (ES) complexes. The substrates are 
bound to a specific region of the enzyme called the active site. Most enzymes 
are highly selective in the substrates that they bind. Indeed, the catalytic 
specificity of enzymes depends in part on the specificity of binding. 

What is the evidence for the existence of an enzyme-substrate complex? 

1. The first clue was the observation that, at a constant concentration of 
enzyme, the reaction rate increases with increasing substrate concentration 
until a maximal velocity is reached (Figure 8.4). In contrast, uncatalyzed re- 
actions do not show this saturation effect. Thefact that an enzyme-catalyzed 
reaction has a maximal velocity suggests the formation of a discrete ES com¬ 
plex. At a sufficiently high substrate concentration, all the catalytic sites are 
filled and so the reaction rate cannot increase. Although indirect, this is the 
most general evidence for the existence of ES complexes. 

2. X-ray crystallography has provided high-resolution images of substrates 
and substrate analogs bound to the active sites of many enzymes (Figure 8.5). 
In Chapter 9, we will take a dose look at several of these complexes. X-ray 
studies carried out at low temperatures (to slow reactions down) are provid- 
ing revealing views of enzyme-substrate complexes and their subsequent 
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"I think that enzymes are molecules 
that are complementary in structure to 
the activated complexes of the reactions 
that they catalyze, that is, to the 
molecular configuration that is 
intermediate between the reacting 
substances and the products of 
reaction for these catalyzed processes. 
The attraction of the enzyme molecule 
for the activated complex would thus 
lead to a decrease in its energy and 
hence to a decrease in the energy of 
activation of the reaction and to an 
increase in the rate of reaction." 

—Linus Pauling 
Nature 161 (1948) :707 



Substrate concentration 


FIGURE 8.4 Reaction velocity versus 
substrate concentration in an enzyme- 
catalyzed reaction. An enzyme-catalyzed 
reaction reaches a maximal velocity. 
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FIGURE 8.5 Structure of an enzyme-substrate complex. (Left) The enzyme 
cytochrome P-450 is illustrated bound to its substrate camphor. (Right) In the active site, 
the substrate is surrounded by residues from the enzyme. Note aiso the presence of a 
heme cofactor. 



FIGURE 8.6 Change in spectroscopíc 
characterístícs with the formation 
of an enzyme-substrate complex. 

Fluorescence intensity of the pyridoxal 
phosphate group at the active site of 
tryptophan synthetase changes on addition 
of serine and indole, the substrates. 


reactions. A new technique, time-resolved crystallography, depends on co- 
crystallizing a photolabile substrate analog with the enzyme. The substrate 
analog can be converted to substrate light, and images of the enzyme- 
substrate complex are obtained in a fraction of a second by scanning the 
crystal with intense, polychromatic x-rays from a synchrotron. 

3. The spectroscopíc characterístícs of many enzymes and substrates change 
on formation of an ES complex. These changes are particularly striking if 
the enzyme contains a colored prosthetic group. Tryptophan synthetase, a 
bacterial enzyme that contains a pyridoxal phosphate (PLP) prosthetic 
group, provides a nice illustration. This enzyme catalyzes the synthesis of 
L- tryptophan from L- serine and indole-derivative. The addition of L- serine 
to the enzyme produces a marked increase in the fluorescence of the PLP 
group (Figure 8.6). The subsequent addition of indole, the second substrate, 
quenches this fluorescence to a levei even lower than that of the enzyme 
alone. Thus, fluorescence spectroscopy reveals the existence of an en¬ 
zyme—serine complex and of an enzyme—serine-indole complex. Other 
spectroscopíc techniques, such as nuclear magnetic resonance and electron 
spin resonance, also are highly informative about ES interactions. 

8.3.2 The Active Sites of Enzymes Have Some Common Features 

The active site of an enzyme is the region that binds the substrates (and the 
cofactor, if any). It also contains the residues that directly participate in the 
making and breaking of bonds. These residues are called the catalytíc groups. 
In essence, the ínteractíon of the enzyme and substrate at the active site pro- 
motes the formation of the transition State. The active site is the region of the 
enzyme that most directly lowers the AG"*" of the reaction, which results in 
the rate enhancement characteristic of enzyme action. Although enzymes 
differ widely in structure, specificity, and mode of catalysis, a number of 
generalizations concerning their active sites can be stated: 

1. The active site is a three-dimensional cleft formed by groups that come 
from different parts of the amino acid sequence—indeed, residues far apart 
in the sequence may interact more strongly than adjacent residues in the 
amino acid sequence. In lysozyme, an enzyme that degrades the cell walls 
of some bactéria, the important groups in the active site are contributed by 






(A) 



(B) II II III II I h C 

1 35 52 62,63 101 108 129 
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FIGURE 8.7 Active sites may 
include distant residues. (A) Ribbon 
diagram of the enzyme lysozyme with 
several components of the active site 
shown in color. (B) A schematic 
representation of the primary structure of 
lysozyme shows that the active site is 
composed of residues that come from 
different parts of the polypeptide chain. 


residues numbered 35, 52, 62, 63, and 101 in the sequence of the 129 amino 
acids (Figure 8.7). 

2. The active site takes up a relatively small part of the total volume of an 
enzyme. Most of the amino acid residues in an enzyme are not in contact 
with the substrate, which raises the intriguing question of why enzymes are 
so big. Nearly all enzymes are made up of more than 100 amino acid 
residues, which gives them a mass greater than 10 kd and a diameter of more 
than 25 A. The ‘‘extra'' amino acids serve as a scaffold to create the three- 
dimensional active site from amino acids that are far apart in the primary 
structure. Amino acids near to one another in the primary structure are of- 
ten sterically constrained from adopting the structural relations necessary 
to form the active site. In many proteins, the remaining amino acids also 
constitute regulatory sites, sites of interaction with other proteins, or chan- 
nels to bring the substrates to the active sites. 

3. Active sites are clefts or crevices. In all enzymes of known structure, sub¬ 
strate molecules are bound to a cleft or crevice. Water is usually excluded 
unless it is a reactant. The nonpolar character of much of the cleft enhances 
the binding of substrate as well as catalysis. Nevertheless, the cleft may also 
contain polar residues. In the nonpolar microenvironment of the active site, 
certain of these polar residues acquire special properties essential for sub¬ 
strate binding or catalysis. The internai positions of these polar residues are 
biologically crucial exceptions to the general rule that polar residues are ex- 
posed to water. 

4. Substrates are bound to enzymes by multiple weak attractions. ES com- 
plexes usually have equilibrium constants that range from 10~^ to 10~^ M, 
corresponding to free energies of interaction ranging from about —3 to 
—12 kcal mol~^ (from —13 to — 50 kj mol~^). The noncovalent interactions 
in ES complexes are much weaker than covalent bonds, which have ener¬ 
gies between —50 and —110 kcal moP^ (between —210 and —460 kJ 
moP^). As discussed in Chapter 1 (Section 1.3.1), electrostatic interactions, 
hydrogen bonds, van der Waals forces, and hydrophobic interactions me- 
diate reversible interactions of biomolecules. Van der Waals forces become 
significant in binding only when numerous substrate atoms simultaneously 
come dose to many enzyme atoms. Hence, the enzyme and substrate should 
have complementary shapes. The directional character of hydrogen bonds 
between enzyme and substrate often enforces a high degree of specificity, 
as seen in the RNA-degrading enzyme ribonuclease (Figure 8.8). 



H 

/O 

/ Serine 

-C side chain 

H2 


FIGURE 8.8 Hydrogen bonds between 
an enzyme and substrate. The enzyme 
ribonuclease forms hydrogen bonds with 
the uridine component of the substrate. 
[After F. M. Richards, H. W. Wyckoff, and N. Allewel. 
In The Neurosciences: Second Study Program, 

F. o. Schmidt, Ed. (Rockefeller University Press, 
1970), p. 970.] 
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FIGURE 8.9 Lock-and-key model of 
enzyme-substrate bíndíng. In this 
model, the active site of the unbound 
enzyme is complementary in shape to 
the substrate. 



Substrate concentration [S] —^ 


FIGURE 8.11 Míchaelís-Menten kinetics. 

A plot of the reaction velocity (I/q) as a 
function of the substrate concentration [S] 
for an enzyme that obeys Michaelis-Menten 
kinetics shows that the maximal velocity 
('/max) is approached asymptotically. The 
Michaelis constant (/Cm) is the substrate 
concentration yielding a velocity of l/max/2. 



5. The specxficity of binding depends on the precisely defined arrangement of 
atoms in an active site. Because the enzyme and the substrate interact by 
means of short-range forces that require dose contact, a substrate must have 
a matching shape to fit into the site. Emil Fischer^s analogy of the lock and 
key (Figure 8.9), expressed in 1890, has proved to be highly stimulating and 
fruitful. However, we now know that enzymes are flexible and that the 
shapes of the active sites can be markedly modified by the binding of sub¬ 
strate, as was postulated by Daniel E. Koshland, Jr., in 1958. The active site 
of some enzymes assume a shape that is complementary to that of the tran- 
sition State only after the substrate is bound. This process of dynamic recog- 
nition is called inducedfit (Figure 8.10). 


Substrate 


ES complex 


Enzyme 




FIGURE 8.10 Induced-fit model of enzyme-substrate binding. In this model, the 
enzyme changes shape on substrate binding. The active site forms a shape complementary 
to the substrate only after the substrate has been bound. 


8.4 THE MICHAELIS-IVIENTEN MODEL ACCOUNTS 
FOR THE KINETIC PROPERTIES OF MANY ENZYIVIES 

The primary function of enzymes is to enhance rates of reactions so that 
they are compatible with the needs of the organism. To understand how en¬ 
zymes function, we need a kinetic description of their activity. For many en¬ 
zymes, the rate of catalysis Vq, which is defined as the number of moles of 
product formed per second, varies with the substrate concentration [S] in a 
manner shown in Figure 8.11. The rate of catalysis rises linearly as substrate 
concentration increases and then begins to levei off and approach a maxi- 
mum at higher substrate concentrations. Before we can accurately interpret 









this graph, we need to understand how it is generated. Consider an enzyme 
that catalyzes the S to P by the foliowing pathway: 

fel ^2 

E + + P 

fe-l fe_2 

The extent of product formation is determined as a function of time for a 
series of substrate concentrations (Figure 8.12). As expected, in each case, 
the amount of product formed increases with time, although eventually a 
time is reached when there is no net change in the concentration of S or P. 
The enzyme is still actively converting substrate into product and visa versa, 
but the reaction equilibrium has been attained. Figure 8.13A illustrates the 
changes in concentration observed in all of the reaction participants with 
time until equilibrium has been reached. 

Enzyme kinetics are more easily approached if we can ignore the back 
reaction. We define Vq as the rate of increase in product with time when [P] 
is low; that is, at times dose to zero (hence, Vq) (Figure 8.13B). Thus, for 
the graph in Figure 8.11, Vq is determined for each substrate concentration 
by measuring the rate of product formation at early times before P accu- 
mulates (see Figure 8.12). 

We begin our kinetic examination of enzyme activity with the graph 
shown in Figure 8.11. At a fixed concentration of enzyme, Vq is almost lin- 
early proportional to [S] when [S] is small but is nearly independent of [S] 
when [S] is large. In 1913, Leonor Michaelis and Maud Menten proposed 
a simple model to account for these kinetic characteristics. The criticai fea- 
ture in their treatment is that a specific ES complex is a necessary interme- 
diate in catalysis. The model proposed, which is the simplest one that ac- 
counts for the kinetic properties of many enzymes, is 

E + S^ES^E + P (9) 

fe-i 

An enzyme E combines with substrate S to form an ES complex, with a rate 
constant ki. The ES complex has two possible fates. It can dissociate to E 
and S, with a rate constant fe_i, or it can proceed to form product P, with a 
rate constant fe 2 . Again, we assume that almost none of the product reverts 
to the initial substrate, a condition that holds in the initial stage of a reac¬ 
tion before the concentration of product is appreciable. 

We want an expression that relates the rate of catalysis to the concen¬ 
trations of substrate and enzyme and the rates of the individual steps. Our 



FIGURE 8.12 Determiníng initial 
velocity. The amount of product formed at 
different substrate concentrations is plotted 
as a function of time. The initial velocity 
(l/o) for each substrate concentration is 
determined from the slope of the curve at 
the beginning of a reaction, when the 
reverse reaction is insignificant. 


FIGURE 8.13 Changes in the 
concentration of reaction participants 
of an enzyme-catalyzed reaction with 
time. Concentration changes under 
(A) steady-state conditions, and (B) the 
pre-steady-state conditions. 
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CHAPTER 8 • Basic Concepts and Kinetics centration of the ES complex and ^ 2 - 

Vo = fe2[ES] (10) 

Now we need to express [ES] in terms of known quantities. The rates of for- 
mation and breakdown of ES are given by: 

Rate of formation of ES = [E] [S] (11) 

Rate of breakdown of ES = (fe_i + fe 2 )[ES] (12) 

To simplify matters, we will work under the steady-state assumption. In a 
steady State, the concentrations of intermediates, in this case [ES], stay the 
same even if the concentrations of starting materiais and products are 
changing. This occurs when the rates of formation and breakdown of the 
ES complex are equal. Setting the right-hand sides of equations 11 and 12 
equal gives 

fejElS]=(fe_i + fe2)[ES] (13) 

By rearranging equation 13, we obtain 

[E][S]/[ES] = (fe_i + k2)/h (14) 

Equation 14 can be simplified by defining a new constant, K^, called the 
Michaelis constant: 


Ku = 


fe 1 + feo 


fel 


(15) 


Note that Rm has the units of concentration. is an important charac- 
teristic of enzyme-substrate interactions and is independent of enzyme and 
substrate concentrations. 

Inserting equation 15 into equation 14 and solving for [ES] yields 


[ES] 


[E][S] 


(16) 


Now let us examine the numerator of equation 16. The concentration of 
uncombined substrate [S] is very nearly equal to the total substrate con¬ 
centration, provided that the concentration of enzyme is much lower than 
that of substrate. The concentration of uncombined enzyme [E] is equal 
to the total enzyme concentration [E]t minus the concentration of the ES 
complex. 

[E] = [e]t - [ES] (17) 


Substituting this expression for [E] in equation 16 gives 

,gg, ^ (|E]t - |ES|)|S| 

Km 

Solving equation 18 for [ES] gives 

[ES] = 

1 + 1s]/Km 


or 

[s] 

[s] + Ku 


(18) 


(19) 


[ES] = [E]x 


( 20 ) 








By substituting this expression for [ES] into equation 10, we obtain 

, , [s] 

Vo = fc;|E|T |„|‘ V (21) 

[o\ + ÍVm 

The maximal rate, V^ax» is attained when the catalytic sites on the en- 
zyme are saturated with substrate—that is, when [ES] = [E]^. Thus, 

V„ax = hlElr (22) 

Substituting equation 22 into equation 21 yields the Michaelis-Menten 
equation: 

Vo = V^,. (23) 

[bj + Km 

This equation accounts for the kinetic data given in Figure 8.11. At very 
low substrate concentration, when [S] is much less than Kmj Vq ~ 
(Vmax/Í^ m)/[S]; that is, the rate is directly proportional to the substrate con¬ 
centration. At high substrate concentration, when [S] is much greater than 
Km^ ^0 ~ V^max) is, the rate is maximal, independent of substrate 
concentration. 

The meaning of Km is evident from equation 23. When [S] = Km, then 

“ V^max/2. Thus, Km is equal to the substrate concentration at which the 
reaction rate is half its maximal value. Km is an important characteristic of 
an enzyme-catalyzed reaction and is significant for its biological function. 

The physiological consequence of Km is illustrated by the sensitivity of 
some individuais to ethanol. Such persons exhibit facial flushing and rapid 
heart rate (tachycardia) after ingesting even small amounts of alcohol. In the 
liver, alcohol dehydrogenase converts ethanol into acetaldehyde. 
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Alchohol 

4- dchydroffcnâsc 4- 

CH3CH2OH + NAD ^ CH3CHO + H + NADH 

Normally, the acetaldehyde, which is the cause of the symptoms when 
present at high concentrations, is processed to acetate by acetaldehyde 
dehydrogenase. 


Acetaldehyde 

4- dchydroscnâsc — 4- 

CH3CHO + NAD - CH3COO +NADH + 2H 

Most people have two forms of the acetaldehyde dehydrogenase, a low Km 
mitochondrial form and a high Km cytosolic form. In susceptible persons, the 
mitochondrial enzyme is less active due to the substitution of a single amino 
acid, and acetaldehyde is processed only by the cytosolic enzyme. Because 
this enzyme has a high Km, l^ss acetaldehyde is converted into acetate; excess 
acetaldehyde escapes into the blood and accounts for the physiological effects. 

8 . 4.1 The Significance of Kf^ and l/^ax Values 


CONCEPTUAL INSIGHTS, Steady-State Enzyme Kinetics. Learn how the kinetic 
parameters Km and V^ax can be determined experimentally using the enzyme kinetics lab 
simulation in this media module. 

The Michaelis constant. Km, and the maximal rate, V^ax, can be readily 
derived from rates of catalysis measured at a variety of substrate concen¬ 
trations if an enzyme operates according to the simple scheme given in 
equation 23. The derivation of Km and V^ax is most commonly achieved 











-1 204 I- 

CHAPTER 8 • Basic Concepts and Kinetics 


r 


TABLE 8.5 values of some enzymes 


Enzyme 

Substrate 

KM(fiiM) 

Chymotrypsin 

Acety 1 - L - try ptophanamide 

5000 

Lysozyme 

Hexa-N-acetylglucosamine 

6 

P-Galactosidase 

Lactose 

4000 

Threonine deaminase 

Threonine 

5000 

Carbonic anhydrase 

CO 2 

8000 

Penicillinase 

Benzylpenicillin 

50 

Pyruvate carboxylase 

Pyruvate 

400 


HCO 3 - 

1000 


ATP 

60 

Arginine-tRNA synthetase 

Arginine 

3 


tRNA 

0.4 


ATP 

300 


with the use of curve-fitting programs on a Computer (see the appendix to 
this chapter for alternative means of determining Km and The Km 

values of enzymes range widely (Table 8 . 5 ). For most enzymes, Km lies be- 
tween and M. The Km value for an enzyme depends on the par¬ 
ticular substrate and on environmental conditions such as pH, temperature, 
and ionic strength. The Michaelis constant, Km, has two meanings. First, 
Km is the concentration of substrate at which half the active sites are filled. 
Thus, Km provides a measure of the substrate concentration required for 
significant catalysis to occur. In fact, for many enzymes, experimental evi- 
dence suggests that Km provides an approximation of substrate concentra¬ 
tion in vivo. When the Km is known, the fraction of sites filled, /es, at any 
substrate concentration can be calculated from 


V [s] 
V^ax [s] + Ku 


(24) 


Second, Km is related to the rate constants of the individual steps in the 
catalytic scheme given in equation 9 . In equation 15 , Km is defined as 
{k-i + k2)/ki. Consider a limiting case in which k-i is much greater than 
fe2. Under such circumstances, the ES complex dissociates to E and S much 
more rapidly than product is formed. Under these conditions (fe-i >> ^2), 





(25) 


The dissociation constant of the ES complex is given by 

_[E][S]_fe_i 
[ES] fel 


( 26 ) 


In other words. Km is equal to the dissociation constant of the ES complex if 
k2 is much smaller than k_2. When this condition is met. Km is a measure 
of the strength of the ES complex: a high Km indicates weak binding; a low 
Km indicates strong binding. It must be stressed that Km indicates the affin- 
ity of the ES complex only when fe_i is much greater than fe2. 

The maximal rate, V^ax, reveals the turnover number of an enzyme, which 
is the number of substrate molecules converted into product by an enzyme mole- 
cule in a unit time when the enzyme is fully saturated with substrate. It is equal 
to the kinetic constant fe2, which is also called fe^af The maximal rate, V^ax» 
reveals the turnover number of an enzyme if the concentration of active sites 












[E]x is known, because 

V^max ^ fe2[E]T 

and thus 

k 2 = V^j[E]j ( 27 ) 

For example, a M solution of carbonic anhydrase catalyzes the for- 
mation of 0.6 M H2CO3 per second when it is fully saturated with sub¬ 
strato. Hence, fe2 is 6 X 10 ^ s~^. This turnover number is one of the largest 
known. Each catalyzed reaction takes place in a time equal to l/fe2, which 
is 1.7 jULS for carbonic anhydrase. The turnover numbers of most enzymes 
with their physiological substratos fali in the range from 1 to per sec¬ 
ond (Table 8.6). 

8.4.2 Kinetic Perfection in Enzymatic Catalysis: 

The kcat/Kf^ Criterion 

When the substrato concentration is much greater than the rate of cata¬ 
lysis is equal to fecat» the turnover number, as described in Section 8 . 4 . 1 . 
However, most enzymes are not normally saturated with substrato. Under 
physiological conditions, the [S]/ÍCm ratio is typically between 0.01 and 1.0. 
When [S] << Km) the enzymatic rate is much less than fe^at because most 
of the active sites are unoccupied. Is there a number that characterizes the 
kinetics of an enzyme under these more typical cellular conditions? Indeed 
there is, as can be shown by combining equations 10 and 16 to give 

Vo = -^[ElS] ( 28 ) 

When [S] << Km» the concentration of free enzyme, [E], is nearly equal to 
the total concentration of enzyme [Ex], so 

Vo = |^[sIe]t ( 29 ) 

Thus, when [S] << the enzymatic velocity depends on the values of 
fecat/í^M> [S]> [E]x. Under these conditions, fecat/^M is the rate constant 

for the interaction of S and E and can be used as a measure of catalytic ef- 
ficiency. For instance, by using fecat/í^M values, one can compare an en¬ 
zymes preference for different substrates. Table 8.7 shows the fecat/í^M 


r 


TABLE 8.6 Maximum turnover 
numbers of some enzymes 


Enzyme 

Turnover 

number 

(per 

second) 

Carbonic anhydrase 

600,000 

3-Ketosteroid 

280,000 

isomerase 

Acetylcholinesterase 

25,000 

Penicillinase 

2,000 

Lactate 

1,000 

dehydrogenase 

Chymotrypsin 

100 

DNA polymerase I 

15 

Tryptophan synthetase 2 

Lysozyme 

0.5 
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TABLE 8.7 Substrate preferentes of chymotrypsin 


Amino acid in ester 

Amino acid side chain 

fecat/KM (S“' M-1) 

Glycine 

— H 

CIÍ3 

—CH 

\ 

CH 3 

1.3 X 10“^ 

Valine 

2.0 

Norvaline 

—CH 2 CH 2 GH 3 

3.6 X 10^ 

Norleucine 

—CH 2 CH 2 GH 2 CH 3 

3.0 X 10^ 

Phenylalanine 


1.0 X 10® 


Source: After A. Fersht, Structure and Mechanism in Protein Science: A Guide to Enzyme 
Catalysis and Protein Folding (W. H. Freeman and Company, 1999), Table 7.3. 
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Circe effect— 

The utilization of attractive forces to 
lure a substrate into a site in which 
it undergoes a transformation of 
structure, as defined by William P. 
Jencks, an enzymologist, who coined 
the term. 

A goddess of Greek mythology, Circe 
lured Odysseus's men to her house 
and then transformed them into pigs. 


values for several different substrates of chymotrypsin (Section 9 . 1 . 1 ). Chy- 
motrypsin clearly has a preference for cleaving next to bulky, hydrophobic 
side chains. 

How efficient can an enzyme be? We can approach this question by de- 
termining whether there are any physical limits on the value of fecat/^M- 
Note that this ratio depends onki, fe_i, and fe2, as can be shown by substi- 
tuting for Km- 


^cat 


^cat 

fe-1 + fecat/fel 


^cat 

fecat + fe-1 


ki< kl 


( 30 ) 


Suppose that the rate of formation of product (fecat) is much faster than the 
rate of dissociation of the ES complex (fe-i). The value of fecat/^^M then 
approaches fei. Thus, the ultimate limit on the value of fecat/í^M is set 
by fel, the rate of formation of the ES complex. This rate cannot be faster 
than the diffusion-controlled encounter of an enzyme and its substrate. Diffu- 
sion limits the value of fei so that it cannot be higher than between 10^ and 
10 ^ M~b Hence, the upper limit on fecat/^^M is between 10^ and 

The fecat/í^M ratios of the enzymes superoxide dismutase, acetyl- 
cholinesterase, and triosephosphate isomerase are between 10^ and 10^ s~^ 
M~b Enzymes such as these that have fecat/^M ratios at the upper limits 
have attained kinetic perfection. Their catalytic velocity is restricted only by 
the rate at which they encounter substrate in the solution (Table 8.8). Any fur- 
ther gain in catalytic rate can come only by decreasing the time for diffu- 
sion. Remember that the active site is only a small part of the total enzyme 
structure. Yet, for catalytically perfect enzymes, every encounter between 
enzyme and substrate is productive. In these cases, there may be attractive 
electrostatic forces on the enzyme that entice the substrate to the active site. 
These forces are sometimes referred to poetically as Circe effects. 

The limit imposed by the rate of diffusion in solution can also be partly 
overcome by confining substrates and products in the limited volume of a 
multienzyme complex. Indeed, some series of enzymes are associated into 
organized assemblies (Section 17 . 1 . 9 ) so that the product of one enzyme is 
very rapidly found by the next enzyme. In effect, products are channeled 
from one enzyme to the next, much as in an assembly line. 


I- 

TABLE 8.8 Enzymes for which /rcai/f^ivi ís dose to the diffusion-controlled 
rate of encounter 

Enzyme fecat/^M 


Acetylcholinesterase 

1.6 

X 

10 ' 

Carbonic anhydrase 

8.3 

X 

10 ' 

Catalase 

4 

X 

10 ' 

Crotonase 

2.8 

X 

10 ' 

Fumar ase 

1.6 

X 

10 ' 

Triose phosphate isomerase 

2.4 

X 

10 ' 

P-Lactamase 

1 

X 

10 ' 

Superoxide dismutase 

7 

X 

10 ' 


Source: After A. Fersht, Structure and Mechanism in Protein Science: A Guide 
to Enzyme Catalysis and Protein Folding (W. H. Freeman and Company, 1999), 
Table 4.5. 
















8.4.3 Most Biochemical Reactions Include Multiple Substrates 

Most reactions in biological systems usually include two substrates and two 
Products and can be represented by the bisubstrate reaction: 

A + B^^P + Q 

The majority of such reactions entail the transfer of a functional group, such 
as a phosphoryl or an ammonium group, from one substrate to the other. 
In oxidation-reduction reactions, electrons are transferred between sub¬ 
strates. Multiple substrate reactions can be divided into two classes: se- 
quential displacement and double displacement. 

Sequential Displacement. In the sequential mechanism, all substrates 
must bind to the enzyme before any product is released. Consequently, in 
a bisubstrate reaction, a ternary complex of the enzyme and both substrates 
forms. Sequential mechanisms are of two types: ordered, in which the sub¬ 
strates bind the enzyme in a defined sequence, and random. 

Many enzymes that have NAD^ or NADH as a substrate exhibit the 
sequential ordered mechanism. Consider lactate dehydrogenase, an impor- 
tant enzyme in glucose metabolism (Section 16 . 1 . 9 ). This enzyme reduces 
pyruvate to lactate while oxidizing NADH to NAD^. 




+ NADH + H+ 


Pyruvate 


HO-C-H + NAD+ 


CHj 

Lactate 


In the ordered sequential mechanism, the coenzyme always binds first and 
the lactate is always released first. This sequence can be represented as fol¬ 
io ws in a notation developed by W. Wallace Cleland: 


NADH Pyruvate Lactate NAD'^ 

Enzyme 1 _i_I_ í Enzyme 

E (NADH) (pyruvate) v E (lactate) (NADT 

The enzyme exists as a ternary complex: first, consisting of the enzyme and 
substrates and, after catalysis, the enzyme and products. 

In the random sequential mechanism, the order of addition of substrates 
and release of products is random. Sequential random reactions are illus- 
trated by the formation of phosphocreatine and ADP from ATP and crea- 
tine, a reaction catalyzed by creatine kinase (Section SO.x.x). 


NHo 


NHo 


02 - 








il 

o 


"N" 


^'NH2 + ATP 


CH3 

Creatine 


il 

O 




"N" 

H 




+ ADP 


CH3 

Phosphocreatine 


Phosphocreatine is an important energy source in muscle. Sequential ran¬ 
dom reactions can also be depicted in the Cleland notation. 


ATP Creatine 


Phosphocreatine ADP 


Enzyme 



E (creatine) 
(ATP) 


E (phosphocreatine) 
(ADP) 



Enzyme 
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Creatine ATP 


ADP Phosphocreatine 
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Although the order of certain events is random, the reaction still passes 
through the ternary complexes including, first, substrates and, then, 
Products. 

Double-Displacement (Ping-Pong) Reactions. In double-displacement, 
or Ping-Pong, reactions, one or more products are released before all sub¬ 
strates bind the enzyme. The defining feature of double-displacement re¬ 
actions is the existence of a substituted enzyme intermediate, in which the en¬ 
zyme is temporarily modified. Reactions that shuttle amino groups between 
amino acids and a-keto acids are classic examples of double-displacement 
mechanisms. The enzyme aspartate aminotransferase (Section 23 .x.x) cat- 
alyzes the transfer of an amino group from aspartate to a-ketoglutarate. 


-ooc 


£ 00 - 




-ooc 


£ 00 - 


a-Ketoglutarate 



The sequence of events can be portrayed as the following diagram. 


Aspartate 
Enzyme 1 


(aspartate) 


Oxaloacetate 


(E-NH3) 

(oxaloacetate) 


(E-NH3) 


-Ketoglutarate 


(c 


(E-NH3) 

-ketoglutarate) 


Glutamate 

í Enzyme 


(glutamate) 



FIGURE 8.14 Kinetics for an allosteric 
enzyme. Allosteric enzymes display a 
sigmoidal dependence of reaction velocity 
on substrate concentration. 


After aspartate binds to the enzyme, the enzyme removes aspartate s amino 
group to form the substituted enzyme intermediate. The first product, ox¬ 
aloacetate, subsequently departs. The second substrate, a-ketoglutarate, 
binds to the enzyme, accepts the amino group from the modified enzyme, 
and is then released as the final product, glutamate. In the Cleland nota- 
tion, the substrates appear to bounce on and off the enzyme analogously to 
a Ping-Pong bali bouncing on a table. 

8.4.4 Allosteric Enzymes Do Not Obey Michaelis-Menten Kinetics 

The Michaelis-Menten model has greatly assisted the development of en¬ 
zyme chemistry. Its virtues are simplicity and broad applicability. However, 
the Michaelis-Menten model cannot account for the kinetic properties of 
many enzymes. An important group of enzymes that do not obey Michaelis- 
Menten kinetics comprises the allosteric enzymes. These enzymes consist of 
multiple subunits and multiple active sites. 

Allosteric enzymes often display sigmoidal plots (Figure 8 . 14 ) of the re¬ 
action velocity Vo versus substrate concentration [S], rather than the hyper- 
bolic plots predicted by the Michaelis-Menten equation (equation 23 ). In al¬ 
losteric enzymes, the binding of substrate to one active site can affect the 
properties of other active sites in the same enzyme molecule. A possible out- 
come of this interaction between subunits is that the binding of substrate be- 
comes cooperative; that is, the binding of substrate to one active site of the 
enzyme facilitates substrate binding to the other active sites. As will be con- 
sidered in Chapter 10, such cooperativity results in a sigmoidal plot of Vq 
versus [S]. In addition, the activity of an allosteric enzyme may be altered by 
regulatory molecules that are reversibly bound to specific sites other than the 
catalytic sites. The catalytic properties of allosteric enzymes can thus be ad- 
justed to meet the immediate needs of a cell (Chapter 10). For this reason, 
allosteric enzymes are key regulators of metabolic pathways in the cell. 
















8.5 ENZYMES CAN BE INHIBITED 
BY SPECIFIC MOLECULES 


Enzyme Inhibition 


The activity of many enzymes can be inhibited by the binding of specific 
small molecules and ions. This means of inhibiting enzyme activity serves 
as a major control mechanism in biological systems. The regulation of al- 
losteric enzymes typifies this type of control. In addition, many drugs and 
toxic agents act by inhibiting enzymes. Inhibition by particular Chemicals 
can be a source of insight into the mechanism of enzyme action: specific in- 
hibitors can often be used to identify residues criticai for catalysis. The value 
of transition- State analogs as potent inhibitors will be discussed shortly. 

Enzyme inhibition can be either reversible or irreversible. An irreversible 
inhibitor dissociates very slowly from its target enzyme because it has be- 
come tightly bound to the enzyme, either covalently or noncovalently. Some 
irreversible inhibitors are important drugs. Penicillin acts by covalently 
modifying the enzyme transpeptidase, thereby preventing the synthesis of 
bacterial cell walls and thus killing the bactéria (Section 8 . 5 . 5 ). Aspirin acts 
by covalently modifying the enzyme cyclooxygenase, reducing the syn¬ 
thesis of inflammatory signals. 

Reversible inhibition, in contrast with irreversible inhibition, is charac- 
terized by a rapid dissociation of the enzyme—inhibitor complex. In com- 
petitive inhibition, an enzyme can bind substrate (forming an ES complex) 
or inhibitor (EI) but not both (ESI). The competitive inhibitor resembles 
the substrate and binds to the active site of the enzyme (Figure 8 . 15 ). The 
substrate is thereby prevented from binding to the same active site. A com¬ 
petitive inhibitor diminishes the rate of catalysis by reducing the proportion of 
enzyme molecules bound to a substrate. At any given inhibitor concentration, 
competitive inhibition can be relieved by increasing the substrate concen¬ 
tration. Under these conditions, the substrate ‘‘outcompetes'' the inhibitor 
for the active site. Methotrexate is a structural analog of tetrahydrofolate, 
a coenzyme for the enzyme dihydrofolate reductase, which plays a role in 
the biosynthesis of purines and pyrimidines (Figure 8 . 16 ). It binds to di¬ 
hydrofolate reductase 1000-fold more tightly than the natural substrate and 
inhibits nucleotide base synthesis. It is used to treat câncer. 



Competitive 




FIGURE 8.15 Dístínction between a 
competitive and a noncompetitive 
inhibitor. (Top) enzyme-substrate 
complex; (middie) a competitive inhibitor 
binds at the active site and thus prevents 
the substrate from binding; (bottom) a 
noncompetitive inhibitor does not prevent 
the substrate from binding. 



H 



FIGURE 8.16 Enzyme inhibitors. The 

cofactor tetrahydrofolate and its structural 
analog methotrexate. Regions with 
structural differences are shown in red. 
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S 

E ^ > ES-> E + P 




FIGURE 8.17 Kinetics of a competitive 
ínhíbítor. As the concentration of a 
competitive inhibitor increases, higher 
concentrations of substrate are required to 
attain a particular reaction velocity. The 
reaction pathway suggests how sufficientiy 
high concentrations of substrate can 
completely relieve competitive inhibition. 


S 

E + I ^ > ES -> E + P 

A A 

s I 

El — EIS 



FIGURE 8.18 Kinetics of a non- 
competitive inhibitor. The reaction 
pathway shows that the inhibitor binds 
both to free enzyme and to enzyme 
complex. Consequentiy, 1/max cannot 
be attained, even at high substrate 
concentrations. 


In noncompetitive inhibition, which also is reversible, the inhibitor and 
substrate can bind simultaneously to an enzyme molecule at different bind- 
ing sites (see Figure 8.16). A noncompetitive inhibitor acts by decreasing 
the turnover number rather than by diminishing the proportion of enzyme 
molecules that are bound to substrate. Noncompetitive inhibition, in con- 
trast with competitive inhibition, cannot be o ver come by increasing the sub¬ 
strate concentration. A more complex pattern, called mixed inhibition, is pro- 
duced when a single inhibitor both hinders the binding of substrate and 
decreases the turnover number of the enzyme. 

8.5.1 Competitive and Noncompetitive Inhibition 
Are Kinetically Distinguishable 

How can we determine whether a reversible inhibitor acts by competitive 
or noncompetitive inhibition? Let us consider only enzymes that exhibit 
Michaelis-Menten kinetics. Measurements of the rates of catalysis at dif¬ 
ferent concentrations of substrate and inhibitor serve to distinguish the two 
types of inhibition. In competitive inhibition, the inhibitor competes with the 
substrate for the active site. The dissociation constant for the inhibitor is 
given by 

K , = [eIiMei] 

Because increasing the amount of substrate can overcome the inhibition, 
V^max can be attained in the presence of a competitive inhibitor (Figure 
8.17). The hallmark of competitive inhibition is that it can be overcome by a 
sufficiently high concentration of substrate. However, the apparent value of 
Km is altered; the effect of a competitive inhibitor is to increase the 
apparent value of Km- This new value of Km, called K^^, is numerically 
equal to 

= Km(1 + [l]/KÒ 

where [I] is the concentration of inhibitor and K[ is the dissociation constant 
for the enzyme-inhibitor complex. As the value of [I] increases, the value of 
increases (see Figure 8.17). In the presence of a competitive inhibitor, 
an enzyme will have the same V^ax ^s in the absence of an inhibitor. 

In noncompetitive inhibition (Figure 8.18), substrate can still bind to the 
enzyme-inhibitor complex. However, the enzyme-inhibitor-substrate 
complex does not proceed to form product. The value of V^ax is decreased 
to a new value called while the value of Km is unchanged. Why is Vj^ax 

lowered while fCMrcmains unchanged? In essence, the inhibitor simply low- 
ers the concentration of functional enzyme. The remaining enzyme behaves 
like a more dilute solution of enzyme; Vj^ax is lower, but Km is the same. 
Noncompetitive inhibition cannot be overcome by increasing the substrate 
concentration. 

8.5.2 Irreversible Inhibitors Can Be Used to Map the Active Site 

In Chapter 9, we will examine the chemical details of how enzymes func- 
tion. The first step in obtaining the chemical mechanism of an enzyme is to 
determine what functional groups are required for enzyme activity. How 
can we ascertain these functional groups? X-ray crystallography (Section 
4.5.2) of the enzyme bound to its substrate provides one approach. Irre¬ 
versible inhibitors that covalently bond to the enzyme provide an alterna- 
tive and often complementary means for elucidating functional groups at 
the enzyme active site because they modify the functional groups, which 
can then be identified. Irreversible inhibitors can be divided into three cat- 
egories: group-specific reagents, substrate analogs, and suicide inhibitors. 























Acetylcholin- DIPF 

esterase 
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FIGURE 8.19 Enzyme inhibition by 
diisopropylphosphofiuoridate (DIPF), 
a group-specific reagent. DIPF can 
inhibit an enzyme by covalentiy modifying 
a crucial serine residue (Section 9.1.1). 


Group-specific reagents react with specific R groups of amino acids. Two 
examples of group-specific reagents are diisopropylphosphofluoridate 
(DIPF; Figure 8.19) and iodoactemide (Figure 8.20). DIPF modifies only 
1 of the 27 serine residues in the proteolytic enzyme chymotrypsin, imply- 
ing that this serine residue is especially reactive. As we will see in Chapter 
9, it is indeed the case that this serine residue is at the active site. DIPF also 
revealed a reactive serine residue in acetylcholinesterase, an enzyme im- 
portant in the transmission of nerve impulses (see Figure 8.19). Thus, DIPF 
and similar compounds that bind and inactivate acetylcholinesterase are po- 
tent nerve gases. 




Enzyme 


lodoacetamide 



+ r + H+ 


Inactivated 

enzyme 


FIGURE 8.20 Enzyme inactivation 
by lodoacetamide, a group-specific 
reagent. lodoacetamide can inactivate 
an enzyme by reacting with a criticai 
cysteine residue. 


Affiinity labeis are molecules that are structurally similar to the substrate 
for the enzyme that covalently modify active site residues. They are thus 
more specific for the enzyme active site than are group-specific reagents. 
Tosyl-L-phenylalanine chloromethyl ketone (TPCK) is a substrate analog 
for chymotrypsin (Figure 8.21). TPCK binds at the active site; and then re- 
acts irreversibly with a histidine residue at that site, inhibiting the enzyme. 
The compound 3-bromoacetol is an affinity labei for the enzyme triose 
phosphate isomerase (TIM). It mimics the normal substrate, dihydroxy- 
acetone phosphate, by binding at the active site; then it covalently modifies 
the enzyme such that the enzyme is irreversibly inhibited (Figure 8.22). 

Suicide inhibitors, or mechanism-based inhibitors are modified substrates 
that provide the most specific means to modify an enzyme active site. The 
inhibitor binds to the enzyme as a substrate and is initially processed by the 
normal catalytic mechanism. The mechanism of catalysis then generates a 
chemically reactive intermediate that inacti vates the enzyme through cova- 
lent modification. The fact that the enzyme participates in its own irre- 
versible inhibition strongly suggests that the covalently modified group on 
the enzyme is catalytically vital. One example of such an inhibitor is N,N- 
dimethylpropargylamine. A flavin prosthetic group of monoamine oxidase 







Chymotrypsin 
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FIGURE 8.21 Affiníty labelíng. 

(A) Tosyl-L-phenylalanine chloromethyl 
ketone (TPCK) is a reactive analog of 
the normal substrate for the enzyme 
chymotrypsin. (B) TPCK binds at the 
active site of chymotrypsin and modifies 
an essential histidine residue. 



Natural substrate for chymotrypsin 


Specificity group 



FIGURE 8.22 Bromoacetol phosphate, 
an affiníty labei for triose phosphate 
ísomerase (TIM). Bromoacetol phosphate, 
an analog of dihydroxyacetone phosphate, 
binds at the active site of the enzyme and 
covalentiy modifies a glutamic acid residue 
required for enzyme activity. 




Triose phosphate 
Ísomerase 
(TIM) 


Bromoacetol 

phosphate 


Inactivated 

enzyme 


FIGURE 8.23 Mechanísm-based 
(suicide) inhibition. Monoamine 
oxidase, an enzyme important for 
neurotransmitter synthesis, requires the 
cofactor FAD (flavin adenine dinucleotide). 
A/,A/-Dimethylpropargylamine inhibits 
monoamine oxidase by covalentiy 
modifying the flavin prosthetic group 
only after the inhibitor is first oxidized. 

The N-5 flavin adduct is stabilized by the 
addition of a proton. 


Flavin prosthetic group 
R 



R 



Stably modified flavin 
of inactivated enzyme 


R 





































(MAO) oxidizes the N,N-dimethylpropargylamine, which in turn inacti- 
vates the enzyme by covalently modifying the flavin prosthetic group by 
alkylating N-5 (Figure 8.23). Monoamine oxidase deaminates neurotrans- 
mitters such as dopamine and serotonin, lowering their leveis in the brain. 
Parkinson disease is associated with low leveis of dopamine, and depression 
is associated with low leveis of serotonin. The drug ( —)deprenyl, which is 
used to treat Parkinson disease and depression, is a suicide inhibitor of 
monoamine oxidase. 


8.5.3 Transition-State Analogs Are Potent Inhibitors of Enzymes 

We turn now to compounds that provide the most intimate views of the cat- 
alytic process itself. Linus Pauling proposed in 1948 that compounds re- 
sembling the transition state of a catalyzed reaction should be very effective 
inhibitors of enzymes. These mimics are called transition-state analogs. The 
inhibition of proline racemase is an instructive example. The racemization 
of proline proceeds through a transition state in which the tetrahedral a- 
carbon atom has become trigonal by loss of a proton (Figure 8.24). In the trig- 
onal form, all three bonds are in the same plane; Co^ also carries a net neg- 
ative charge. This symmetric carbanion can be reprotonated on one side to 
give the L isomer or on the other side to give the D isomer. This picture is 
supported by the finding that the inhibitor pyrrole 2-carboxylate binds to 
the racemase 160 times as tightly as does proline. The a-carbon atom of this 
inhibitor, like that of the transition state, is trigonal. An analog that also car¬ 
ries a negative charge on would be expected to bind even more tightly. 
In general, synthesizing compounds that more closely resemble the transi¬ 
tion State than the substrate itself can produce highly potent and specific 
inhibitors of enzymes. The inhibitory power of transition-state analogs un- 
derscores the essence of catalysis: selective binding of the transition state. 
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(-)Deprenyl 


(A) 


(B) 




H 
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FIGURE 8.24 Inhibition by transition state analogs. (A) The isomerization of i-proline 
to D-proline by proline racemase, a bacterial enzyme, proceeds through a planar transition 
State in which the a carbon is trigonal rather than tetrahedral. (B) Pyrrole 2-carboxylate, a 
transition state analog because of its trigonal geometry, is a potent inhibitor of proline 
racemase. 


8.5.4 Catalytic Antibodies Demonstrate the Importance 
of Selective Binding of the Transition State to Enzymatic Activity 

Antibodies that recognize transition States should function as catalysts, if 
our understanding of the importance of the transition state to catalysis is 
correct. The preparation of an antibody that catalyzes the insertion of a metal 
ion into a porphyrin nicely illustrates the validity of this approach. Fer- 
rochelatase, the final enzyme in the biosynthetic pathway for the produc- 
tion of heme, catalyzes the insertion of Fe^^ into protoporphyrin IX. The 
nearly planar porphyrin must be bent for iron to enter. The recently deter- 
mined crystal structure of the ferrochelatase bound to a substrate analog 
confirms that the enzyme does indeed bend one of the pyrole rings, dis- 
torting it 36 degrees to insert the iron. 
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FIGURE 8.25 Use of transítíon-state 
analogs to generate catalytic 
antíbodies. The insertion of a metal ion 
into a porphyrin by ferrochelatase proceeds 
through a transition State in which the 
porphyrin is bent. AZ-Methylmesoporphyrin, 
a bent porphyrin that resembles the 
transition State of the ferrochelatase- 
catalyzed reaction, was used to generate 
an antibody that aiso catalyzes the insertion 
of a metal ion into a porphyrin ring. 


The problem was to find a transition-state analog for this metallation 
reaction that could be used as an antigen (immunogen) to generate an 
antibody. The solution carne from the results of studies showing that an 
alkylated porphyrin, N-methylprotoporphyrin, is a potent inhibitor of 
ferrochelatase. This compound resembles the transition state because 
N-alkylation forces the porphyrin to be bent. Moreover, it was known that N- 
alkylporphyrins chelate metal ions times as fast as their unalkylated 
counterparts do. Bending increases the exposure of the pyrrole nitrogen lone 
pairs of electrons to solvent, which facilitates metal in binding. 

An antibody catalyst was produced with the use of an N-alkylporphyrin 
as the immunogen. The resulting antibody presumably distorts a planar 
porphyrin (Figure 8.25) to facilitate the entry of a metal. On average, an an¬ 
tibody molecule metallated 80 porphyrin molecules per hour, a rate only 
10-fold less than that of ferrochelatase and 2500-fold faster than the uncat- 
alyzed reaction. Catalytic antibodies (abzymes) can indeed be produced by us- 
ing transition-State analogs as antigens. Antibodies catalyzing many other 
kinds of Chemical reactions—exemplified by ester and amide hydrolysis, 
amide-bond formation, transesterification, photoinduced cleavage, pho- 
toinduced dimerization, decarboxylation, and oxidization —have been pro¬ 
duced with the use of similar strategies. The results of studies with transi¬ 
tion-state analogs provide strong evidence that enzymes can function 
complementary in structure to the transition state. The power of transition- 
State analogs is now evident: (1) they are sources of insight into catalytic mech- 
anisms, (2) they can serve as potent and specific inhibitors of enzymes, and 
(3) they can be used as immunogens to generate a wide range of novel catalysts. 

8.5.5 Penicillin Irreversibly Inactivates a Key Enzyme 
in Bacterial Cell-Wall Synthesis 

T Penicillin, the first antibiotic discovered, consists of a thiazolidine ring 

... . ^ 
fused to a [3 -lactam ring, to which a variable R group is attached by a 

peptide bond (Figure 8.26A). In benzyl penicillin, for example, R is a benzyl 
group (Figure 8.26B). This structure can undergo a variety of rearrangements, 
and, in particular, the |3 -lactam ring is very labile. Indeed, this instability is 
closely tied to the antibiotic action of penicillin, as will be evident shortly. 

How does penicillin inhibit bacterial growth? In 1957, Joshua Lederberg 
showed that bactéria ordinarily susceptible to penicillin could be grown in 
its presence if a hypertonic médium were used. The organisms obtained in 
this way, called protoplasts, are devoid of a cell wall and consequently lyse 
when transferred to a normal médium. Hence, penicillin was inferred to in¬ 
terfere with the synthesis of the bacterial cell wall. The cell-wall macro- 
molecule, called ãpeptidoglycan, consists of linear polysaccharide chains that 



FIGURE 8.26 Structure of penicillin. The reactive site of penicillin is the peptide bond of 
its p-lactam ring. (A) Structural formula of penicillin. (B) Representation of benzyl penicillin. 





are cross-linked by short peptides (Figure 8.27). The enormous bag-shaped 
peptidoglycan confers mechanical support and prevents bactéria from 
bursting in response to their high internai osmotic pressure. 

In 1965, James Park and Jack Strominger independently deduced that 
penicillin blocks the last step in cell-wall synthesis—namely, the cross- 
linking of different peptidoglycan strands. In the formation of the cell wall 
of Staphylococcus aureus, the amino group at one end of a pentaglycine chain 
attacks the peptide bond between two D-alanine residues in another peptide 
unit (Figure 8.28). A peptide bond is formed between glycine and one of 
the D-alanine residues; the other D-alanine residue is released. This cross- 
linking reaction is catalyzed by glycopeptide transpeptidase. Bacterial cell 
walls are unique in containing D amino acids, which form cross-links by a 
mechanism different from that used to synthesize proteins. 

Penicillin inhibits the cross-linking transpeptidase by the Trojan horse 
stratagem. The transpeptidase normally forms an acyl intermediate with the 
penultimate D-alanine residue of the D-Ala-D-Ala peptide (Figure 8.29). 
This covalent acyl-enzyme intermediate then reacts with the amino group 
of the terminal glycine in another peptide to form the cross-link. Penicillin 
is welcomed into the active site of the transpeptidase because it mimics the 
D-Ala-D-Ala moiety of the normal substrate (Figure 8.30). Bound penicillin 
then forms a covalent bond with a serine residue at the active site of the en- 
zyme. This penicilloyl-enzyme does not react further. Hence, the transpepti¬ 
dase is irreversibly inhibited and cell-wall synthesis cannot take place. 



FIGURE 8.27 Schematíc representatíon 
of the peptidoglycan in Staphylococcus 
aureus. The sugars are shown in yellow, the 
tetrapeptides in red, and the pentaglycine 
bridges in blue. The cell wall is a single, 
enormous, bag-shaped macromolecule 
because of extensive cross-linking. 



Terminal glycine Terminal o-Ala-o-Ala unit Gly-o-Ala crossiink 

residue of 
pentaglycine 
bridge 



FIGURE 8.28 Formation of cross-links in 5. aureus peptidoglycan. The terminal 
amino group of the pentaglycine bridge in the cell wall attacks the peptide bond between 
two D-alanine residues to form a cross-link. 
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FIGURE 8.29 Transpeptidation reaction. An acyl-enzyme intermediate is formed in the 
transpeptidation reaction leading to cross-link formation. 




FIGURE 8.30 Conformations of 
penicillin and a normal substrate. 

The conformation of penicillin in the 
vicinity of its reactive peptide bond 

(A) resembles the postulated conformation 
of the transition State of R-o-Ala-o-Ala 

(B) in the transpeptidation reaction. [After 
B. Lee. J. Mol. Biol. 61 (1971);464.] 


Penicillin 
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FIGURE 8.31 Formation of a 
penicilloyl-enzyme complex. Penicillin 
reacts with the transpeptidase to form 
an inactive complex, which is indefinitely 
stable. 



Glycopeptide 

transpeptidase 



Penicilloyl-enzyme complex 

(enzymatically inactive) 


Why is penicillin such an effective inhibitor of the transpeptidase? The 
highly strained, four-membered |3 -lactam ring of penicillin makes it 
especially reactive. On binding to the transpeptidase, the serine residue at 
the active site attacks the carbonyl carbon atom of the lactam ring to form 
the penicilloyl-serine derivative (Figure 8.31). Because the peptidase par- 
ticipates in its own inactivation, penicillin acts as a suicide inhibitor. 


8.6 VITAMINS ARE OFTEN PRECURSORS TO COENZYMES 


T Earlier (Section 8.1.1), we considered the fact that many enzymes re- 
quire cofactors to be catalytically active. One class of these cofactors, 
termed coenzymes, consists of small organic molecules, many of which are 
derived from vitamins. Vitamins themselves are organic molecules that are 
needed in small amounts in the diets of some higher animais. These mole- 
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TABLE 8.9 Water-Soluble Vitamins 


Vitamin 

Coenzyme 

Typical reaction type 

Consequences of deficiency 

Thiamine (Bi) 

Thiamine pyrophosphate 

Aldehyde transfer 

Beriberi (weight loss, 
heart problems, 
neurological 
dysfunction) 

Riboflavin (B 2 ) 

Flavin adenine 
dinucleotide (FAD) 

Oxidation-reduction 

Cheliosis and angular 
stomatitus (lesions of the 
mouth), dermatitis 

Pyridoxine (Bó) 

Pyridoxal phosphate 

Group transfer to or from 
amino acids 

Depression, confusion, 
convulsions 

Nicotinic acid (niacin) 

Nicotinamide adenine 
dinucleotide (NAD"^) 

Oxidation-reduction 

Pellagra (dermatitis, 
depression, diarrhea) 

Pantothenic acid 

Coenzyme A 

Acyl-group transfer 

Hypertension 

Biotin 

Biotin-lysine complexes 

ATP - dependent 

Rash about the 


(biocytin) 

carboxylation and 
carboxyl-group transfer 

eyebrows, muscle pain, 
fatigue (rare) 

Folie acid 

Tetrahydrofolate 

Transfer of one-carbon 
components; thymine 
synthesis 

Anemia, neural-tube 
defects in development 

Bi2 

5' -Deoxyadenosyl cobalamin 

Transfer of methyl 
groups; intramolecular 
rearrangements 

Anemia, pernicious 
anemia, methylmalonic 
acidosis 

C (ascorbic acid) 


Antioxidant 

Scurvy (swollen and 
bleeding gums, 
subdermal hemorrhages) 
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TABLE 8.10 Fat-soluble vitamins 


Vitamin 

Function 

Deficiency 

A 

Antioxidant 

Inhibition of sperm production; 
lesions in muscles and nerves (rare) 

D 

Regulation of calcium and 
phosphate metabolism 

Rickets (children): skeletal 
deformaties, impaired growth 
Osteomalacia (adults): soft, 
bending bones 

E 

Roles in vision, growth, 
reproduction 

Night blindness, córnea 
damage, damage to respiratory 
and gastrointestinal tract 

K 

Blood coagulation 

Subdermal hemorrhaging 
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cules serve the same roles in nearly all forms of life, but higher animais lost 
the capacity to synthesize them in the course of evolution. For instance, 
whereas E. coli can thrive on glucose and organic salts, human beings re- 
quire at least 12 vitamins in the diet. The biosynthetic pathways for vita¬ 
mins can be complex; thus, it is biologically more efficient to ingest vita¬ 
mins than to synthesize the enzymes required to construct them from simple 
molecules. This efficiency comes at the cost of dependence on other organ- 
isms for Chemicals essential for life. Indeed, vitamin deficiency can gener- 
ate diseases in all organisms requiring these molecules (Tables 8.9 and 8.10). 
Vitamins can be grouped according to whether they are soluble in water or 
in nonpolar solvents. 

8.6.1 Water-Soluble Vitamins Function As Coenzymes 

T Table 8.9 lists the water-soluble vitamins —ascorbic acid (vitamin C) 
and a series known as the vitamin B complex (Figure 8.32). Ascor- 
bate, the ionized form of ascorbic acid, serves as a reducing agent (an an- 
tioxidant), as will be discussed shortly. The vitamin B series comprises com- 
ponents of coenzymes. Note that, in all cases except vitamin C, the vitamin 
must be modified before it can serve its function. 

Vitamin deficiencies are capable of causing a variety of pathological con- 
ditions (see Table 8.9). However, many of the same symptoms can result 
from conditions other than lack of a vitamin. For this reason and because 
vitamins are required in relatively small amounts, pathological conditions 
resulting from vitamin deficiencies are often difficult to diagnose. 

The requirement for vitamin C proved relatively straightforward to 
demonstrate. This water-soluble vitamin is not used as a coenzyme but is 



FIGURE 8.32 Structures of some 
water-soluble vitamins. 
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(Ribofiavin) 



Vitamin B3 
(Niacin) 


Vitamin Be 
(Pyridoxine) 















FIGURE 8.33 Formation of 4-hydroxyproline. Prolíne is hydroxylated at C-4 by the 

action of prolyl hydroxylase, an enzyme that activates molecular oxygen. 





FIGURE 8.34 Forms of ascorbic acíd 
(vitamin C). Ascorbate is the ionized 
form of vitamin C, and dehydroascorbic 
acid is the oxidized form of ascorbate. 


still required for the continued activity of proyl hydroxylase. This enzyme 
synthesizes 4-hydroxyproline, an amino acid that is required in collagen, 
the major connective tissue in vertebrates, but is rarely found anywhere else. 
How is this unusual amino acid formed and what is its role? The results of 
radioactive-labeling studies showed that proline residues on the amino side 
of glycine residues in nascent collagen chains become hydroxylated. The 
oxygen atom that becomes attached to C-4 of proline comes from molecu¬ 
lar oxygen, O 2 . The other oxygen atom of O 2 is taken up by n-ketoglutarate, 
which is converted into succinate (Figure 8.33). This complex reaction is 
catalyzed by prolyl hydroxylase, a dioxygenase. It is assisted by an Fe^^ ion, 
which is tightly bound to it and needed to activate O 2 . The enzyme also 
converts a-ketoglutarate into succinate without hydroxylating proline. In 
this partial reaction, an oxidized iron complex is formed, which inactivates 
the enzyme. How is the active enzyme regenerated? Ascorbate (vitamin C) 
comes to the rescue by reducing the ferric ion of the inactivated enzyme. In 
the recovery process, ascorbate is oxidized to dehydroascorbic acid (Figure 
8.34). Thus, ascorbate serves here as a specific antioxidant. 

Primates are unable to synthesize ascorbic acid and hence must acquire 
it from their diets. The importance of ascorbate becomes strikingly evident 
in scurvy. Jacques Cartier in 1536 gave a vivid description of this dietary de- 
ficiency disease, which afflicted his men as they were exploring the Saint 
Lawrence Ri ver: 

Some did lose all their strength, and could not stand on their feet. ... 
Others also had all their skins spotted with spots of blood of a purple 
colour: then did it ascend up to their ankles, knees, thighs, shoulders, arms, 
and necks. Their mouths became stinking, their gums so rotten, that all 
the flesh did fali off, even to the roots of the teeth, which did also almost 
all fali out. 

James Lind, a Scottish physician, illuminated the means of preventing 
scurvy in an article titled ‘‘A Treatise of the Scurvy'' published in 1747. Lind 
described a controlled study establishing that scurvy could be prevented by 
including citrus fruits in the diet. The Royal Navy eventually began issu- 
ing lime rations to sailors, from which custom British sailors acquired the 
nickname ‘dimeys." Lind's research was inspired by the plight of an expe- 
dition commanded by Commodore George Anson. Anson left England in 
1740 with a fleet of six ships and more than 1000 men and returned with 
an enormous amount of treasure, but of his crew only 145 survived to reach 
home. The remainder had died of scurvy. 

Why does impaired hydroxylation have such devastating consequences? 
Collagen synthesized in the absence of ascorbate is less stable than the normal 
protein. Studies of the thermal stability of synthetic polypeptides have been 









Summary 


especially informative. Hydroxyproline stabilizes the collagen triple helix 
by forming interstrand hydrogen bonds. The abnormal fibers formed by in- 
sufficiently hydroxylated collagen contribute to the skin lesions and blood- 
vessel fragility seen in scurvy. 

8.6.2 Fat-Soluble Vitamins Participate in Diverse Processes 
Such as Blood Clotting and Vision 

T Not all vitamins function as coenzymes. The fat-soluble vitamins, 
which are designated by the letters A, D, E, and K (Figure 8.35, Table 
8 .10), have a diverse array of functions. Vitamin K, which is required for 
normal blood clotting {K from the German feoagulation), participates in the 
carboxylation of glutamate residues to y-carboxyglutamate, which makes 
modified glutamic acid a much stronger chelator of Ca^^ (Section 10.7.7). 
Vitamin A (retinol) is the precursor of retinal, the light-sensitive group in 
rhodopsin and other visual pigments (Section 32.3.1). A deficiency of this 
vitamin leads to night blindness. In addition, young animais require vita¬ 
min A for growth. Retinoic acid, which contains a terminal carboxylate in 
place of the alcohol terminus of retinol, serves as a signal molecule and 
activates the transcription of specific genes that mediate growth and devei- 
opment (Section 31.x.x). A metabolite of vitamin D is a hormone that reg- 
ulates the metabolism of calcium and phosphorus. A deficiency in vitamin 
D impairs bone formation in growing animais. Infertility in rats is a con- 
sequence of vitamin E (a-tocopherol) deficiency. This vitamin reacts with 
and neutralizes reactive oxygen species such as hydroxyl, radicais before 
they can oxidize unsaturated membrane lipids, damaging cell structures. 





SUMMARY 


Enzymes are PowerfuI and Highly Specific Catalysts 

The catalysts in biological systems are enzymes, and nearly all enzymes 
are proteins. Enzymes are highly specific and have great catalytic power. 
They can enhance reaction rates by factors of 10^ or more. Many en¬ 
zymes require cofactors for activity. Such cofactors can be metal ions or 
small, vitamin-derived organic molecules called coenzymes. 
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^ Free Energy Is a UsefuI Thermodynamic Function 
for Understanding Enzymes 

Free energy (G) is the most valuable thermodynamic function for de- 
termining whether a reaction can take place and for understanding the 
energetics of catalysis. A reaction can occur spontaneously only if the 
change in free energy (AG) is negative. The free-energy change of a re¬ 
action that takes place when reactants and products are at unit activity 
is called the standard free-energy change (AG°). Biochemists usually 
use AG°the standard free-energy change at pH 7. Enzymes do not al¬ 
ter reaction equilibria; rather, they increase reaction rates. 

• Enzymes Accelerate Reactions by Facilitating the Formation of the 
Transition State 

Enzymes serve as catalysts by decreasing the free energy of activation 
of Chemical reactions. Enzymes accelerate reactions by providing a re¬ 
action pathway in which the transition state (the highest-energy species) 
has a lower free energy and hence is more rapidly formed than in the 
uncatalyzed reaction. 

The first step in catalysis is the formation of an enzyme-substrate 
complex. Substrates are bound to enzymes at active-site clefts from 
which water is largely excluded when the substrate is bound. The speci- 
ficity of enzyme-substrate interactions arises mainly from hydrogen 
bonding, which is directional, and the shape of the active site, which re- 
jects molecules that do not have a sufficiently complementary shape. 
The recognition of substrates by enzymes is accompanied by confor- 
mational changes at active sites, and such changes facilitate the forma¬ 
tion of the transition state. 


• The Michaelis-Menten Model Accounts for the Kinetic Properties 
of Many Enzymes 

The Michaelis-Menten model accounts for the kinetic properties of 
some enzymes. In this model, an enzyme (E) combines with a substrate 
(S) to form an enzyme-substrate (ES) complex, which can proceed to 
form a product (P) or to dissociate into E and S. 

E + S A±ES^E + P 

The rate Vq of formation of product is given by the Michaelis-Menten 
equation: 


Vo = V, 


[s] 

[s] + Ku 


in which Vmax is the reaction rate when the enzyme is fully saturated 
with substrate and K^, the Michaelis constant, is the substrate con- 
centration at which the reaction rate is half maximal. The maximal rate, 
Vmax> is equal to the product of k 2 or fe^at and the total concentration of 
enzyme. The kinetic constant fe^at, called the turnover number, is 
the number of substrate molecules converted into product per unit time 
at a single catalytic site when the enzyme is fully saturated with sub¬ 
strate. Turnover numbers for most enzymes are between 1 and per 
second. The ratio of fecat/^^M provides a penetrating probe into enzyme 
efficiency. 

Allosteric enzymes constitute an important class of enzymes whose 
catalytic activity can be regulated. These enzymes, which do not con- 
form to Michaelis-Menton kinetics, have multiple active sites. These 
active sites display cooperativity, as evidenced by a sigmoidal depen- 
dence of reaction velocity on substrate concentration. 







• Enzymes Can Be Inhibited by Specific Molecules I 221 | 

Specific small molecules or ions can inhibit even nonallosteric en- Appendix 

zymes. In irreversible inhibition, the inhibitor is covalently linked to 
the enzyme or bound so tightly that its dissociation from the enzyme 
is very slow. Covalent inhibitors provide a means of mapping the en- 
zyme’s active site. In contrast, reversible inhibition is characterized by 
a rapid equilibrium between enzyme and inhibitor. A competitive in¬ 
hibitor prevents the substrate from binding to the active site. It re- 
duces the reaction velocity by diminishing the proportion of enzyme 
molecules that are bound to substrate. In noncompetitive inhibition, 
the inhibitor decreases the turno ver number. Competitive inhibition 
can be distinguished from noncompetitive inhibition by determining 
whether the inhibition can be overcome by raising the substrate 
concentration. 

The essence of catalysis is selective stabilization of the transition 
State. Hence, an enzyme binds the transition state more tightly than the 
substrate. Transition-state analogs are stable compounds that mimic 
key features of this highest-energy species. They are potent and specific 
inhibitors of enzymes. Proof that transition-state stabilization is a key 
aspect of enzyme activity comes from the generation of catalytic anti- 
bodies. Transition-state analogs are used as antigens, or immunogens, 
in generating catalytic antibodies. 

• Vitamins Are Often Precursors to Coenzymes 

Vitamins are small biomolecules that are needed in small amounts in 
the diets of higher animais. The water-soluble vitamins are vitamin C 
(ascorbate, an antioxidant) and the vitamin B complex (components of 
coenzymes). Ascorbate is required for the hydroxylation of proline 
residues in collagen, a key protein of connective tissue. The fat-soluble 
vitamins are vitamin A (a precursor of retinal), D (a regulator of cal- 
cium and phosphorus metabolism), E (an antioxidant in membranes), 
and K (a participant in the carboxylation of glutamate). 


APPENDIX; 1/^ax AND /Cm CAN BE DETERMINED 
BY DOUBLE-RECIPROCAL PLOTS 


Before the availability of computers, the determination of 
and Vmax values required algebraic manipulation of the Basic 
Michaelis-Menten equation. Because V^ax is approached 
asymptotically (see Figure 8.11), it is impossible to obtain a de- 
finitive value from a typicai Michaelis-Menten plot. Because 
is the concentration of substrate at V^sl^/2, it is likewise 
impossible to determine an accurate value of Km- However, V^ax 
can be accurately determined if the Michaelis-Menten equation 
is transformed into one that gives a straight-line plot. Taking the 
reciprocai of both sides of equation 23 gives 


J_ — ^ ^ J_ 

Vo ” ■ [s] 


(31) 



A plot of 1/Vo versus 1/[S], called a Lineweaver-Burk or double- 
reciprocal plot, yields a straight line with an intercept of l/V^ax 
and a slope of K^/Vnia^ (Figure 8.36). The intercept on the y- 
axis is — 1 /Km- 


FIGURE 8.36 A double-reciprocal or Lineweaver-Burk plot. 

A double-reciprocal plot of enzyme kinetics is generated by plotting 
l/l/o as a function 1 /[S]. The slope is the ZCivi/l/max/ the intercept on 
the vertical axis is l/l/max, and the intercept on the horizontal axis 
is — l//C|vi. 
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FIGURE 8.37 Competitive inhibition illustrated on a double- 
reciprocai plot. A double-reciprocal plot of enzyme kinetics in the 
presence ( - - ■ • ) and absence of a competitive inhibitor 

illustrates that the inhibitor has no effect on 1/max but increases Km. 


FIGURE 8.38 Noncompetítíve inhibition illustrated on a 
double-reciprocal plot. A double-reciprocal plot of enzyme kinetics 
in the presence ( ) and absence of a noncompetitive 

inhibitor shows that Km is unaltered and 1/max is decreased. 


Double-reciprocal plots are especially useful for distinguish- 
ing between competitive and noncompetitive inhibitors. In com¬ 
petitive inhibition, the intercept on the y-axis of the plot of 1/Vq 
versus 1/[S] is the same in the presence and in the absence of in¬ 
hibitor, although the slope is increased (Figure 8.37). That the 
intercept is unchanged is because a competitive inhibitor does not 
alter V^ax- At a sufficiently high concentration, virtually all the 
active sites are filled by substrate, and the enzyme is fully opera- 
tive. The increase in the slope of the 1 /Vq versus 1/[S] plot indi- 
cates the strength of binding of competitive inhibitor. In the pres¬ 
ence of a competitive inhibitor, equation 31 is replaced by 


1 _ 1 

Vo “ V_ 


Km Í 

VmaxV 


1 



in which [I] is the concentration of inhibitor and is the disso- 
ciation constant of the enzyme-inhibitor complex. 

E + I^^EI 

K, = 



In other words, the slope of the plot is increased by the factor 
(1 + [l]/Ki) in the presence of a competitive inhibitor. Consider 
an enzyme with a of M. In the absence of inhibitor, Vq 
— Vmax/2 when [S] = M. In the presence of 2 X M 
competitive inhibitor that is bound to the enzyme with a of 
M, the apparent will be equal to Km X (1 + 

[I]/Ki), or 3 X M. Substitution of these values into equa¬ 
tion 23 gives Vo = Vmax/4, when [S] = M. The presence 
of the competitive inhibitor thus cuts the reaction rate in half at 
this substrate concentration. 

In noncompetitive inhibition (Figure 8.38), the inhibitor 
can combine with either the enzyme or the enzyme—substrate 
complex. In pure noncompetitive inhibition, the values of the 
dissociation constants of the inhibitor and enzyme and of the in¬ 
hibitor and enzyme-substrate complex are equal (Section 8.5.1). 
The value of V^ax is decreased to a new value called and 

so the intercept on the vertical axis is increased. The new slope, 
which is equal to Km/ is larger by the same factor. In con- 
trast with Vj^ax» is not affected by pure noncompetitive in¬ 
hibition. The maximal velocity in the presence of a pure non¬ 
competitive inhibitor, Vlnax> is given by 


1 + [I]/K. 
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enzyme (p. 189) 
substrate (p. 190) 
cofactor (p. 191) 
apoenzyme (p. 191) 
holoenzyme (p. 191) 
coenzyme (p. 192) 
prosthetic group (p. 192) 
free energy (p. 193) 
transition State (p. 196) 
free energy of activation 
(p. 196) 

active site (p. 197) 


induced fit (p. 200) 

Km (the Michaelis constant) 
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V^ax(p. 203) 

Michaelis-Menten equation 
(p. 203) 

turno ver number (p. 204) 
fecat/i^M (p. 205) 
sequential displacement reaction 
(p. 207) 

double-displacement (Ping-Pong) 
reaction (p. 207) 


allosteric enzyme (p. 208) 
competitive inhibition (p. 209) 
noncompetitive inhibition (p. 210) 
group-specific reagent (p. 211) 
affinity labei (p. 211) 
mechanism-based (suicide) 
inhibition (p. 211) 
transition-State analog (p. 213) 
catalytic antibody (abzyme) (p. 214) 
vitamin (p. 216) 
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-1 PROBLEMS I- 

1. Hydrolytic driving force. The hydrolysis of pyrophosphate to 
orthophosphate is important in driving forward biosynthetic re- 
actions such as the synthesis of DNA. This hydrolytic reaction 
is catalyzed in Escherichia coli by a pyrophosphatase that has a 
mass of 120 kd and consists of six identical subunits. For this 
enzyme, a unit of activity is defined as the amount of enzyme 
that hydrolyzes 10 pmol of pyrophosphate in 15 minutes at 37 ° C 
under standard assay conditions. The purified enzyme has a 
^max of 2800 units per milligram of enzyme. 

(a) How many moles of substrate are hydrolyzed per second per 
milligram of enzyme when the substrate concentration is much 
greater than 

(b) How many moles of active site are there in 1 mg of enzyme? 
Assume that each subunit has one active site. 

(c) What is the turnover number of the enzyme? Compare this 
value with others mentioned in this chapter. 

2. Destroying the Trojan horse. Penicillin is hydrolyzed and 
thereby rendered inactive by penicillinase (also known as P-lac 


tamase), an enzyme present in some resistant bactéria. The mass 
of this enzyme in Staphylococcus aureus is 29.6 kd. The amount 
of penicillin hydrolyzed in 1 minute in a 10-ml solution contain- 
ing 10~^ g of purified penicillinase was measured as a function 
of the concentration of penicillin. Assume that the concentration 
of penicillin does not change appreciably during the assay. 


[Penicillin] 

(fiiM) 

Amount hydrolyzed 
(nanomoles) 

1 

0.11 

3 

0.25 

5 

0.34 

10 

0.45 

30 

0.58 

50 

0.61 
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(a) Plot Vo versus [S] and 1 / Vq versus 1 / [S] for these data. Does 
penicillinase appear to obey Michaelis-Menten kinetics? If so, 
what is the value of Km? 

(b) What is the value of V^ax? 

(c) What is the turnover number of penicillinase under these 
experimental conditions? Assume one active site per enzyme 
molecule. 

3. Counterpoint. Penicillinase (|3-lactamase) hydrolyzes peni- 
cillin. Compare penicillinase with glycopeptide transpeptidase. 

4. Mode of inhibition. The kinetics of an enzyme are measured 
as a function of substrate concentration in the presence and in 
the absence of 2 mM inhibitor (I). 


kinetic data is to plot Vq versus Vo/[S], which is known as an 
Eadie-Hofstee plot. 

(a) Rearrange the Michaelis-Menten equation to give Vq as a 
function of Vo/[S]. 

(b) What is the significance of the slope, the vertical intercept, 
and the horizontal intercept in a plot of Vq versus Vo/[S]? 

(c) Sketch a plot of Vq versus Vo/[S] in the absence of an in¬ 
hibitor, in the presence of a competitive inhibitor, and in the 
presence of a noncompetitive inhibitor. 

7. Potential donors and acceptors. The hormone progesterone 
contains two ketone groups. At pH 7, which side chains of the 
receptor might form hydrogen bonds with progesterone? 


Velocity (jULinol/minute) 



No inhibitor 

Inhibitor 

3 

10.4 

4.1 

5 

14.5 

6.4 

10 

22.5 

11.3 

30 

33.8 

22.6 

90 

40.5 

33.8 


(a) What are the values of V^ax and Km iia the absence of in¬ 
hibitor? In its presence? 

(b) What type of inhibition is it? 

(c) What is the binding constant of this inhibitor? 

(d) If [S] = 10 fxM and [I] = 2 mM, what fraction of the en¬ 
zyme molecules have a bound substrate? A bound inhibitor? 

(e) If [S] = 30 |jlM, what fraction of the enzyme molecules have 
a bound substrate in the presence and in the absence of 2 mM 
inhibitor? Compare this ratio with the ratio of the reaction ve- 
locities under the same conditions. 


8. Competing substrates. Suppose that two substrates, A and B, 

compete for an enzyme. Derive an expression relating the ratio 
of the rates of utilization of A and B, Va/Vb, to the concentra- 
tions of these substrates and their values of ^2 ^m- (Hint: 

Express Va as a function of k 2 /K^ for substrate A, and do the 
same for Vb.) Is specificity determined by Km alone? 

9. A tenacious mutant. Suppose that a mutant enzyme binds a 
substrate 100-fold as tightly as does the native enzyme. What is 
the effect of this mutation on catalytic rate if the binding of the 
transition State is unaffected? 

10. Uncompetitive inhibition. The following reaction represents 
the mechanism of action of an uncompetitive inhibitor. 

h k2 

E + S ES E + P 

fe_i + 

I 

kl 


5. A different mode. The kinetics of the enzyme considered in 
problem 4 are measured in the presence of a different inhibitor. 
The concentration of this inhibitor is 100 |jlM. 

(a) What are the values of Vj^ax and Km in the presence of this 
inhibitor? Compare them with those obtained in problem 4. 

(b) What type of inhibition is it? 

(c) What is the dissociation constant of this inhibitor? 

pç,, Velocity (|ULmol/minute) 


(|ulM) No inhibitor Inhibitor 


3 

10.4 

2.1 

5 

14.5 

2.9 

10 

22.5 

4.5 

30 

33.8 

6.8 

90 

40.5 

8.1 


(d) If [S] = 30 [xM, what fraction of the enzyme molecules have 
a bound substrate in the presence and in the absence of 100 |jlM 
inhibitor? 

6. A fresh view. The plot of I/Vq versus 1/[S] is sometimes 
called a Lineweaver-Burk plot. Another way of expressing the 


ESI 

(a) Draw a standard Michaelis-Menton curve in the absence and 
in the presence of increasing amounts of inhibitor. Repeat for a 
double-reciprocai plot. 

(b) Explain the results obtained in part a. 

11. More Michaelis-Menten. For an enzyme that follows simple 
Michaelis-Menten kinetics, what is the value of V^ax if Vq is 
equal to 1 pumol/ minute at 1/10 Km? 

Data Interpretation Problems 

12. Varying the enzyme. For a one-substrate, enzyme-catalyzed 
reaction, double-reciprocai plots were determined for three dif¬ 
ferent enzyme concentrations. Which of the following three fam- 
ilies of curve would you expect to be obtained? Explain. 
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13. Too much of a good thing. A simple Michaelis-Menten 
enzyme, in the absence of any inhibitor, displayed the following 


kinetic behavior. The expected value of 
y-axis. 


is shown on the 


o 

't; 



(a) Draw a double-reciprocai plot that corresponds to the 
velocity-versus-substrate curve. 

(b) Provide an explanation for the kinetic results. 

14. Rate-limiting step. In the conversion of A into D in the fol¬ 
lowing biochemical pathway, enzymes Ea, Eb, and Ec have the 
Km values indicated under each enzyme. If all of the substr ates 
and Products are present at a concentration of 10~^ M, which 
step will be rate limiting and why? 


A 


B 


C 


D 




10“^M 


10“^M 


10“^M 


Chapter Integration Problems 

15. Titration experiment. The effect of pH on the activity of an 
enzyme was examined. At its active site, the enzyme has an ion- 
izable group that must be negatively charged for substrate bind- 
ing and catalysis to take place. The ionizable group has a pKa of 
6.0. The substrate is positively charged throughout the pH range 
of the experiment. 


E- 

-h 

H+ 


+ S+ 


E-^S+ 


E- + 


(a) Draw the Vq- versus-pH curve when the substrate concen¬ 
tration is much greater than the enzyme Km- 

(b) Draw the Vo-versus-pH curve when the substrate concen¬ 
tration is much less than the enzyme Km- 

(c) At which pH will the velocity equal one-half of the maximal 
velocity attainable under these condition? 

16. A question of stability. Pyridoxal phosphate (PLP) is a coen- 
zyme for the enzyme ornithine aminotransferase. The enzyme 
was purified from cells grown in PLP-deficient media as well as 
from cells grown in media that contained pyridoxal phosphate. 
The stability of the enzyme was then measured by incubating 
the enzyme at 37°C and assaying for the amount of enzyme ac¬ 
tivity remaining. The following results were obtained. 


II 

lo M 

- (O 





(a) Why does the amount of active enzyme decrease with the 
time of incubation? 

(b) Why does the amount of enzyme from the PLP-deficient 
cells decline more rapidly? 




Media Problem 


17. Not all data points are created equal Your lab partner, who 
is both systematic and frugal, decides to perform a series of en¬ 
zyme assays at substrate concentrations of 1, 2, 4, and 8 pM. 
You argue for doing the experiments at [S] = 1, 4, 16, and 
100 |jlM. Try both sets of experiments using the simulated en¬ 
zyme kinetics lab in the Steady-State Enzyme Kinetics Con¬ 
ceptual Insights module. Who had the better idea, and why? 


EH 





















Catalytic Strategies 


Strategy and tactics. Chess and enzymes have in common the 
use of strategy, consciousiy thought out in the game of chess and 
selected by evolution for the action of an enzyme. The three amino 
add residues at the right, denoted by the white bonds, constitute a 
catalytic triad found in the active site of a class of enzymes that 
cleave peptide bonds. The substrate, represented by the molecule 
with black bonds, is as hopelessiy trapped as the king in the pho- 
tograph of a chess match at the left and is sure to be cleaved. 
[Photograph courtesy of Wendie Berg.] 


OUTLINE 


What are the sources of the catalytic power and specificity of enzymes? This 
chapter presents the catalytic strategies used by four classes of enzymes: the 
serine proteases, carbonic anhydrases, restriction endonucleases, and nucleo- 
side monophosphate (NMP) kinases. The first three classes of enzymes cat- 
alyze reactions that require the addition of water to a substrate. For the ser¬ 
ine proteases, exemplified by chymotrypsin, the challenge is to promote a 
reaction that is almost immeasurably slow at neutral pH in the absence of a 
catalyst. For carbonic anhydrases, the challenge is to achieve a high absolute 
rate of reaction, suitable for integration with other rapid physiological 
processes. For restriction endonucleases such as FcoRV, the 
challenge is to attain a very high levei of specificity. Finally, 
for NMP kinases, the challenge is to transfer a phosphoryl 
group from ATP to a nucleotide and not to water. The ac- 
tions of these enzymes illustrate many important principies 
of catalysis. The mechanisms of these enzymes have been re- 
vealed through the use of incisive experimental probes, in- 
cluding the techniques of protein structure determination 
(Chapter 4) and site-directed mutagenesis (Chapter 6). These 
mechanisms include the use of binding energy and induced 
fit as well as several specific catalytic strategies. Properties 
common to an enzyme family reveal how their enzyme ac¬ 
tive sites have evolved and been refined. Structural and 
mechanistic comparisons of enzyme action are thus sources 
of insight into the evolutionary history of enzymes. These 
comparisons also reveal particularly effective Solutions to 
biochemical problems that are used repeatedly in biological 


9.1 Proteases: Facilitating a Difficult 
Reaction 

9.2 Carbonic Anhydrases: Making a Fast 
Reaction Faster 

9.3 Restriction Enzymes: Performing 
Highly Specific DNA Cleavage Reactions 

9.4 Nucleoside Monophosphate Kinases: 
Catalyzing Phosphoryl Group Exchange 
Without Promoting Hydrolysis 










CHAPTER 9 • Catalytic Strategies 


Systems. In addition, our knowledge of catalytic strategies has been used to 
develop practical applications, including drugs that are potent and specific 
enzyme inhibitors. Finally, although we shall not consider catalytic RN A mol- 
ecules (Section 28.x.x) explicitly in this chapter, the principies apply to these 
catalysts in addition to protein catalysts. 

9.0.1 A Few Basic Catalytic Principies Are Used 
by Many Enzymes 

In Chapter 8, we learned that enzymatic catalysis begins with substrate 
binding. The binding energy is the free energy released in the formation of 
a large number of weak interactions between the enzyme and the substrate. 
We can envision this binding energy as serving two purposes: it establishes 
substrate specificity and increases catalytic efficiency. Only the correct sub¬ 
strate can participate in most or all of the interactions with the enzyme and 
thus maximize binding energy, accounting for the exquisite substrate speci¬ 
ficity exhibited by many enzymes. Furthermore, the full complement of 
such interactions is formed only when the substrate is in the transition state. 
Thus, interactions between the enzyme and the substrate not only favor sub¬ 
strate binding but stabilize the transition state, thereby lowering the acti- 
vation energy. The binding energy can also promote structural changes in 
both the enzyme and the substrate that facilitate catalysis, a process referred 
to as induced fit. 

Enzymes commonly employ one or more of the following strategies to 
catalyze specific reactions: 

1. Covalent catalysis. In covalent catalysis, the active site contains a reac- 
tive group, usually a powerful nucleophile that becomes temporarily cova- 
lently modified in the course of catalysis. The proteolytic enzyme chy- 
motrypsin provides an excellent example of this mechanism (Section 9.1.2). 

2. General acid—base catalysis. In general acid-base catalysis, a molecule 
other than water plays the role of a proton donor or acceptor. Chymotrypsin 
uses a histidine residue as a base catalyst to enhance the nucleophilic power 
of serine (Section 9.1.3). 

3. Metal ion catalysis. Metal ions can function catalytically in several ways. 
For instance, a metal ion may serve as an electrophilic catalyst, stabilizing 
a negative charge on a reaction intermediate. Alternatively, the metal ion 
may generate a nucleophile by increasing the acidity of a nearby molecule, 
such as water in the hydration of CO 2 by carbonic anhydrase (Section 9.2.2). 
Finally, the metal ion may bind to substrate, increasing the number of in¬ 
teractions with the enzyme and thus the binding energy. This strategy is 
used by NMP kinases (Section 9.4.2). 

4. Catalysis by approximation. Many reactions include two distinct sub- 
strates. In such cases, the reaction rate may be considerably enhanced by 
bringing the two substrates together along a single binding surface on an 
enzyme. NMP kinases bring two nucleotides together to facilitate the trans- 
fer of a phosphoryl group from one nucleotide to the other (Section 9.4.3). 


9.1 PROTEASES; FACILITATING A DIFFICULT REACTION 

Protein turno ver is an important process in living systems (Chapter 23). 
Proteins that have served their purpose must be degraded so that their con- 
stituent amino acids can be recycled for the synthesis of new proteins. Pro¬ 
teins ingested in the diet must be broken down into small peptides and 


amino acids for absorption in the gut. Furthermore, as described in detail 
in Chapter 10, proteolytic reactions are important in regulating the activity 
of certain enzymes and other proteins. 

Proteases cleave proteins by a hydrolysis reaction —the addition of a mol- 
ecule of water to a peptide bond: 

O 

R ^ - + R2 -NH3+ 

Although the hydrolysis of peptide bonds is thermodynamically favored, 
such hydrolysis reactions are extremely slow. In the absence of a catalyst, 
the half-life for the hydrolysis of a typicai peptide at neutral pH is estimated 
to be between 10 and 1000 years. Yet, peptide bonds must be hydrolyzed 
within milliseconds in some biochemical processes. 

The Chemical bonding in peptide bonds is responsible for their kinetic 
stability. Specifically, the resonance structure that accounts for the planarity 
of a peptide bond (Section 3.2.2) also makes such bonds resistant to hy¬ 
drolysis. This resonance structure endows the peptide bond with partial 
double-bond character: 


9-3 I- 

Proteases 



The carbon-nitrogen bond is strengthened by its double-bond character, and 
the carbonyl carbon atom is less electrophilic and less susceptible to núcleo- 
philic attack than are the carbonyl carbon atoms in compounds such as car- 
boxylate esters. Consequently, to promote peptide-bond cleavage, an enzyme 
must facilitate nucleophilic attack at a normally unreactive carbonyl group. 

9.1.1 Chymotrypsin Possesses a Highly Reactive Serine Residue 

A number of proteolytic enzymes participate in the breakdown of proteins 
in the digestive systems of mammals and other organisms. One such en¬ 
zyme, chymotrypsin, cleaves peptide bonds selectively on the carboxyl- 
terminal side of the large hydrophobic amino acids such as tryptophan, ty- 
rosine, phenylalanine, and methionine (Figure 9.1). Chymotrypsin is a good 
example of the use of covalent modification as a catalytic strategy. The en¬ 
zyme employs a powerful nucleophile to attack the unreactive carbonyl 
group of the substrate. This nucleophile becomes covalently attached to the 
substrate briefly in the course of catalysis. 



FIGURE 9.1 Specifícity of chymotrypsin. 

Chymotrypsin cleaves proteins on the 
carboxyl side of aromatic or large 
hydrophobic amino acids (shaded yellow). 
The likely bonds cleaved by chymotrypsin 
are indicated in red. 
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FIGURE 9.2 An unusually reactíve 
seríne in chymotrypsín. Chymotrypsin is 
inactivated by treatment with diisopropy- 
Iphosphofluoridate (DIPF), which reacts 
only with serine 195 among 28 possible 
serine residues. 


Ser 195 


CH3 
CH3 


O 


-OH + 




XH3 
CH3 

DIPF 



+ F- + H 


FIGURE 9.3 Chromogeníc substrate. 

A/-Acetyl-L-phenylalanine p-nitrophenyl 
ester yields a yellow product, p-nitrophen- 
olate, on cleavage by chymotrypsin. 
p-Nitrophenolate forms by deprotonation 
of p-nitrophenol at pH 7. 


What is the nucleophile that chymotrypsin employs to attack the sub¬ 
strate carbonyl group? A clue carne from the fact that chymotrypsin con- 
tains an extraordinarily reactive serine residue. Treatment with organoflu- 
orophosphates such as diisopropylphosphofluoridate (DIPF); (Section 8.4.1) 
was found to inactivate the enzyme irreversibly (Figure 9.2). Despite the 
fact that the enzyme possesses 28 serine residues, only one, serine 195, was 
modified, resulting in a total loss of enzyme activity. This chemical modifi- 
cation reaction suggested that this unusually reactive serine residue plays a 
central role in the catalytic mechanism of chymotrypsin. 

9.1.2 Chymotrypsin Action Proceeds in Two Steps Linked 
by a Covalentiy Bound Intermediate 

How can we elucidate the role of serine 19 5 in chymotrypsin action? A study 
of the enzyme s kinetics provided a second clue to chymotrypsin s catalytic 
mechanism and the role of serine 195. The kinetics of enzyme action are of- 
ten easily monitored by having the enzyme act on a substrate analog that 
forms a colored product. For chymotrypsin, such a chromogenic substrate is 
iV-acetyl-L-phenylalanine p-nitrophenyl ester. This substrate is an ester 
rather than an amide, but many proteases will also hydrolyze esters. One of 
the Products formed by chymotrypsin’s cleavage of this substrate is p- 
nitrophenolate, which has a yellow color (Figure 9.3). Measurements of the 
absorbance of light revealed the amount ofp-nitrophenolate being produced. 

Under steady-State conditions, the cleavage of this substrate obeys 
Michaelis-Menten kinetics with a Km of 20 |ulM and a fecat of 77 s~b The 
initial phase of the reaction was examined by using the stopped-flow 
method. This technique permits the rapid mixing of enzyme and substrate 
and allows almost instantaneous monitoring of the reaction. At the begin- 
ning of the reaction, this method revealed a 'Turst'' phase during which the 











colored product was produced rapidly (Figure 9.4). Product was then pro- 
duced more slowly as the reaction reached the steady State. These results 
suggest that hydrolysis proceeds in two steps. The burst is observed be- 
cause, for this substrate, the first step is more rapid than the second step. 

The two steps are explained by the reaction of the serine nucleophile 
with the substrate to form the covalently bound enzyme-substrate inter- 
mediate (Figure 9.5). First, the highly reactive serine 195 hydroxyl group 
attacks the carbonyl group of the substrate to form the acyl-enzyme inter- 
mediate, releasing the alcohol p-nitrophenol (or an amine if the substrate is 
an amide rather than an ester). Second, the acyl-enzyme intermediate is hy- 
drolyzed to release the carboxylic acid component of the substrate and re- 
generate the free enzyme. Thus, p-nitrophenolate is produced rapidly on 
the addition of the substrate as the acyl-enzyme intermediate is formed, but 
it takes longer for the enzyme to be 'heset'' by the hydrolysis of the acyl- 
enzyme intermediate. 



FIGURE 9.4 Kinetícs of chymotrypsín 
catalysís. Two stages are evident in the 
cleaving of A/-acetyl-L-phenylalanine 
p-nitrophenyl ester by chymotrypsín: a 
rapid burst phase (pre-steady State) and 
a steady-state phase. 


(A) 


(B) 


OH + X 


Acylation 


XH = ROH (ester), RNH 2 (amide) 




Deacylation 

- 7 ^-^ 

H2O 


OH + HO 


Enzyme 


Acyl-enzyme 


Enzyme 


FIGURE 9.5 Covalent catalysís. Hydrolysis by chymotrypsín takes place in two stages: 
(A) acylation to form the acyl-enzyme intermediate followed by (B) deacylation to 
regenerate the free enzyme. 


9.1.3 Serine is Part of a Catalytic Triad That Aiso Includes 
Histidine and Aspartic Acid 

The determination of the three-dimensional structure of chymotrypsín by 
David Blow in 1967 was a source of further insight into its mechanism of ac- 
tion. Overall, chymotrypsín is roughly spherical and comprises three polypep- 
tide chains, linked by disulfide bonds. It is synthesized as a single polypep- 
tide, termed chymotrypsinogen, which is activated by the 
proteolytic cleavage of the polypeptide to yield the three 
chains. The active site of chymotrypsín, marked by serine 
195, lies in a cleft on the surface of the enzyme (Figure 9.6). 

The structural analysis revealed the chemical basis of the 
special reactivity of serine 195 (Figure 9.7). The side chain 
of serine 195 is hydrogen bonded to the imidazole ring of 
histidine 57. The -NH group of this imidazole ring is, in 
turn, hydrogen bonded to the carboxylate group of aspar- 
tate 102. This constellation of residues is referred to as the 
catalytic triad. How does this arrangement of residues lead 
to the high reactivity of serine 195? The histidine residue 
serves to position the serine side chain and to polarize its 
hydroxyl group. In doing so, the residue acts as a general 


% 


CONCEPTUAL INSIGHTS, 
Enzyme Kinetícs. See the section 
entitied "Pre-Steady-State Kinetícs" in 
Conceptual Insights module to better 
understand why a "burst" phase at 
short reaction times implies the 
existence of an enzyme-substrate 
intermediate. 


Serine 195 


FIGURE 9.6 Three-dimensional structure of 
chymotrypsín. The three chains are shown in ribbon form in 
orange, blue, and green. The side chains of the catalytic triad residues, 
including serine 195, are shown as ball-and-stick representations, as 
are two intrastrand and interstrand disulfide bonds. 













FIGURE 9.7 The catalytíc tríad. The catalytic triad, shown on the left, converts serine 
195 into a potent nucleophile, as illustrated on the right. 


FIGURE 9.8 Peptide hydrolysís by chy- 
motrypsín. The mechanism of peptide 
hydrolysis illustrates the principies of cova- 
lent and acid-base catalysis. The dashed 
green lines indicate favorable interactions 
between the negatively charged aspartate 
residue and the positively charged histi- 
dine residue, which make the histidine 
residue a more powerful base. 


base catalyst, a hydrogen ion acceptor, because the polarized hydroxyl group 
of the serine residue is poised for deprotonation. The withdrawal of the pro- 
ton from the hydroxyl group generates an alkoxide ion, which is a much more 
powerful nucleophile than an alcohol is. The aspartate residue helps orient the 
histidine residue and make it a better proton acceptor through electrostatic 
effects. 

These observations suggest a mechanism for peptide hydrolysis (Figure 
9.8). After substrate binding (step 1), the reaction begins with the hydroxyl 
group of serine 195 making a nucleophilic attack on the carbonyl carbon 
atom of the substrate (step 2). The nucleophilic attack changes the geome- 
try around this carbon atom from trigonal planar to tetrahedral. The inher- 
ently unstable tetrahedral intermediate formed bears a formal negative 
charge on the oxygen atom derived from the carbonyl group. This charge 
is stabilized by interactions with NH groups from the protein in a site 


Oxyanion 

hole 
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Oxyanion hole 


Ser 195 


FIGURE 9.9 The oxyanion hole. The 

structure stabilizes the tetrahedral interme- 
diate of the chymotrypsin reaction. Hydro- 
gen bonds (shown in green) link peptide 
NH groups and the negatively charged 
oxygen. 



Proteases 


FIGURE 9.10 The hydrophobic pocket 
of chymotrypsin. The hydrophobic 
pocket of chymotrypsin is responsible for 
its substrate specificity. The key amino 
acids that constitute the binding site are 
labeled, including the active-site serine 
residue (boxed). The position of an 
aromatic ring bound in the pocket is 
shown in green. 


termed the oxyanion hole (Figure 9.9). These interactions also help stabilize 
the transition state that precedes the formation of the tetrahedral interme- 
diate. This tetrahedral intermediate then collapses to generate the acyl-en- 
zyme (step 3). This step is facilitated by the transfer of a proton from the 
positively charged histidine residue to the amino group formed by cleavage 
of the peptide bond. The amine component is now free to depart from the 
enzyme (step 4) and is replaced by a water molecule (step 5). The ester group 
of the acyl-enzyme is now hydrolyzed by a process that is essentially a re- 
peat of steps 2 through 4. The water molecule attacks the carbonyl group 
while a proton is concomitantly removed by the histidine residue, which 
now acts as a general acid catalyst, forming a tetrahedral intermediate (step 
6). This structure breaks down to form the carboxylic acid product (step 7). 
Finally, the release of the carboxylic acid product (step 8) readies the en¬ 
zyme for another round of catalysis. 

This mechanism accounts for all characteristics of chymotrypsin action 
except the observed preference for cleaving the peptide bonds just past 
residues with large, hydrophobic side chains. Examination of the three- 
dimensional structure of chymotrypsin with substrate analogs and enzyme 
inhibitors revealed the presence of a deep, relatively hydrophobic pocket, 
called the Si pocket, into which the long, uncharged side chains of residues 
such as phenylalanine and tryptophan can fit. The binding of an appropri- 
ate side chain into this pocket positions the adjacent peptide bond into the ac¬ 
tive site for cleavage (Figure 9.10). The specificity of chymotrypsin depends 
almost entirely on which amino acid is directly on the amino-terminal side 
of the peptide bond to be cleaved. Other proteases have more-complex 
specificity patterns, as illustrated in Figure 9.11. Such enzymes have addi- 
tional pockets on their surfaces for the recognition of other residues in the 
substrate. Residues on the amino-terminal side of the scissile bond (the bond 



FIGURE 9.11 Specificity nomenclature 
for protease-substrate interactions. 

The potential sites of interaction of the 
substrate with the enzyme are designated 
P (shown in red), and corresponding 
binding sites on the enzyme are 
designated S. The scissile bond (also 
shown in red) is the reference point. 
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FIGURE 9.12 Structural 


símílarity of trypsín and chymotrypsín. 

An overlay of the structure of chymotrypsín 
(red) on that of trypsín (blue) shows the 
hígh degree of símílarity. Only a-carbon 
atom posítíons are shown. The mean 
devíatíon ín posítíon between correspon- 
ding a-carbon atoms ís 1.7 Â. 


to be cleaved) are labeled P^, P 2 , P 3 , and so forth, indicating their positions 
in relation to the scissile bond. Likewise, residues on the carboxyl side of 
the scissile bond are labeled P/, P 2 ', P 3 ', and so forth. The corresponding 
sites on the enzyme are referred to as Si, S 2 or Si', S 2 ^ and so forth. 




STRUCTURAL INSIGHTS, Chymotrypsín: A Seríne Protease. Work with Interactive 
molecular models to learn more about the structural bases of active site specificity and 
reactivity, and some of the ways in which active site residues can be identified. 


9.1.4 Catalytic Triads Are Found in Other Hydrolytic Enzymes 

Y Many other proteins have subsequently been found to contain cat¬ 
alytic triads similar to that discovered in chymotrypsin. Some, such 
as trypsin and elastase, are obvious homologs of chymotrypsin. The se- 
quences of these proteins are approximately 40% identical with that of chy¬ 
motrypsin, and their overall structures are nearly the same (Figure 9.12). 
These proteins operate by mechanisms identical with that of chymotrypsin. 
However, they have very different substrate specificities. Trypsin cleaves at 
the peptide bond after residues with long, positively charged side chains— 
namely, arginine and lysine—whereas elastase cleaves at the peptide bond 
after amino acids with small side chains—such as alanine and serine. Com- 
parison of the Si pockets of these enzymes reveals the basis of the speci¬ 
ficity. In trypsin, an aspartate residue (Asp 189) is present at the bottom of 
the Si pocket in place of a serine residue in chymotrypsin. The aspartate 
residue attracts and stabilizes a positively charged arginine or lysine residue 
in the substrate. In elastase, two residues at the top of the pocket in chy¬ 
motrypsin and trypsin are replaced with valine (Vai 190 and Vai 216). These 
residues dose off the mouth of the pocket so that only small side chains may 
enter (Figure 9.13). 

Other members of the chymotrypsin family include a collection of pro¬ 
teins that take part in blood clotting, to be discussed in Chapter 10 . In ad- 
dition, a wide range of proteases found in bactéria and viruses also belong 
to this clan. Furthermore, other enzymes that are not homologs of chy¬ 
motrypsin have been found to contain very similar active sites. As noted in 
Chapter 7, the presence of very similar active sites in these different pro- 
tein families is a consequence of convergent evolution. Subtilisin, a protease 
in bactéria such as Bacillus amyloliquefadens, is a particularly well charac- 
terized example. The active site of this enzyme includes both the catalytic 
triad and the oxyanion hole. However, one of the NH groups that forms the 
oxyanion hole comes from the side chain of an asparagine residue rather 
than from the peptide backbone (Figure 9.14). Subtilisin is the founding 
member of another large family of proteases that includes representatives 
from Archaea, Eubacteria, and Eukarya. 

Yet another example of the catalytic triad has been found in carboxy- 
peptidase II from wheat. The structure of this enzyme is not significantly 
similar to either chymotrypsin or subtilisin (Figure 9.15). This protein is a 
member of an intriguing family of homologous proteins that includes es- 
terases such as acetylcholine esterase and certain lipases. These enzymes all 
make use of histidine-activated nucleophiles, but the nucleophiles may be 
cysteine rather than serine. Finally, other proteases have been discovered that 
contain an active-site serine or threonine residue that is activated not by a 
histidine-aspartate pair but by a primary amino group from the side chain 
of lysine or by the N-terminal amino group of the polypeptide chain. 

Thus, the catalytic triad in proteases has emerged at least three times in 
the course of evolution. We can conclude that this catalytic strategy must 
be an especially effective approach to the hydrolysis of peptides and related 
bonds. 
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FIGURE 9.13 The Si pockets of chymotrypsín, trypsin, and elastase. Certain residues 
play key roles in determining the specificity of these enzymes. The side chains of these 
residues, as well as those of the active-site serine residues, are shown in color. 




Asp 32 


Oxyanion hole 


Asn 155 


Ser 221 


His 64 


FIGURE 9.14 The catalytic triad and oxyanion hole of subtilisin. The peptide bond 
attacked by nucleophilic serine 221 of the catalytic triad wili develop a negative charge, 
which is stabilized by enzyme NH groups (both in the backbone and in the side chain of 
Asn 155) located in the oxyanion hole. 


Tyr 147 




FIGURE 9.15 Carboxypeptidase II. The structure of carboxypeptidase II from 
wheat (right) is illustrated with its two chains (blue and red). The catalytic triad of 
carboxypeptidase II (left) is composed of the same amino acids as those in chymotrypsin, 
despite the fact that the enzymes display no structural similarity. The residues that form the 
oxyanion hole are highlighted in yellow. 








FIGURE 9.16 Síte-directed muta- 
genesis of subtílísín. Residues of the 
catalytic triad were mutated to alanine, 
and the activity of the mutated enzyme 
was measured. Mutations in any 
component of the catalytic triad cause a 
dramatic loss of enzyme activity. Note that 
the activity is displayed on a logarithmic 
scale. The mutations are identified as 
follows: the first letter is the one-letter 
abbreviation for the amino add being 
altered; the number identifies the position 
of the residue in the primary structure; 
and the second letter is the one-letter 
abbreviation for the amino add replacing 
the original one. 


9.1.5 The Catalytic Triad Has Been Dissected 
by Site-Directed Mutagenesis 

The techniques of molecular biology discussed in Chapter 
6 have permitted detailed examination of the catalytic triad. 
In particular, site-directed mutagenesis has been used to 
test the contribution of individual amino acid residues to 
the catalytic power of an enzyme. Subtilisin has been ex- 
tensively studied by this method. Each of the residues 
within the catalytic triad, consisting of aspartic acid 32, his- 
tidine 64, and serine 221, has been individually converted 
into alanine, and the ability of each mutant enzyme to cleave 
a model substrate has been examined (Figure 9.16). As ex- 
pected, the conversion of active-site serine 221 into alanine 
dramatically reduced catalytic power; the value of kcat 
to less than one-millionth of its value for the wild-type en¬ 
zyme. The value of ÍCm was essentially unchanged: its increase by no more 
than a factor of two indicated that substrate binding is not significantly af- 
fected. The mutation of histidine 64 to alanine had very similar effects. These 
observations support the notion that the serine-histidine pair act together to 
generate a nucleophile of sufficient power to attack the carbonyl group of a 
peptide bond. The conversion of aspartate 32 into alanine had a smaller effect, 
although the value of fe^at still fell to less than 0.005% of its wild-type value. 
The simultaneously conversion of all three catalytic triad residues into alanine 
was no more deleterious than the conversion of serine or histidine alone. De- 
spite the reduction in their catalytic power, the mutated enzymes still hy- 
drolyze peptides a thousand times as rapidly as does buffer at pH 8.6. 

Because the oxyanion hole of subtilisin includes a side-chain NH group 
in addition to backbone NH groups, it is possible to probe the importance 
of the oxyanion hole for catalysis by site-directed mutagenesis. The muta¬ 
tion of asparagine 155 to glycine reduced the value of fe^at to 0.2% of its wild- 
type value but increased the value of by only a factor of two. These ob¬ 
servations demonstrate that the NH group of the asparagine residue plays 
a significant role in stabilizing the tetrahedral intermediate and the transi- 
tion State leading to it. 


9.1.6 Cysteine, Aspartyl, and Metalloproteases Are Other Major 
Classes of Peptide-Cleaving Enzymes 

r Not all proteases utilize strategies based on activated serine residues. 

Classes of proteins have been disco vered that employ three alter - 
native approaches to peptide-bond hydrolysis (Figure 9.17). These classes 
are the (1) cysteine proteases, (2) aspartyl proteases, and (3) metallopro¬ 
teases. In each case, the strategy generates a nucleophile that attacks the 
peptide carbonyl group (Figure 9.18). 

The strategy used by the cysteine proteases is most similar to that used by 
the chymotrypsin family. In these enzymes, a cysteine residue, activated by 
a histidine residue, plays the role of the nucleophile that attacks the peptide 
bond (see Figure 9.18), in a manner quite analogous to that of the serine 
residue in serine proteases. An ideal example of these proteins is papain, an 
enzyme purified from the fruit of the papaya. Mammalian proteases homol- 
ogous to papain have been discovered, most notably the cathepsins, proteins 
having a role in the immune and other systems. The cysteine-based active 
site arose independently at least twice in the course of evolution; the caspases, 
enzymes that play a major role in apoptosis (Section 2.4.3), have active sites 
similar to that of papain, but their overall structures are unrelated. 
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FIGURE 9.17 Three classes of 
proteases and theír active sites. These 
examples of a cysteine protease, an as- 
partyl protease, and a metalloprotease use 
a histidine-activated cysteine residue, an 
aspartate-activated water molecule, and a 
metal-activated water molecule, respec- 
tively, as the nucleophile. The two halves 
of renin are in blue and red to highlight 
the approximate twofold symmetry of 
aspartyl proteases. 


(A) CYSTEINE PROTEASES (B) ASPARTYL PROTEASES 



(C) METALLOPROTEASES 



FIGURE 9.18 The activation strategies for three classes of proteases. The peptide 
carbonyl group is attacked by (A) a histidine-activated cysteine, in the cysteine proteases; 
(B) an aspartate-activated water molecule, in the aspartyl proteases; and (C) a metal- 
activated water molecule, in the metalloproteases. For the metalloproteases, the letter B 
represents a base (often a glutamate) that helps deprotonate the metal-bound water. 
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water molecule to attack the peptide bond. One aspartic acid residue (in its 
deprotonated form) activates the attacking water molecule by poising it for 
deprotonation, whereas the other aspartic acid residue (in its protonated 
form) polarizes the peptide carbonyl, increasing its susceptibility to attack 
(see Figure 9.18). Members of this class include renin, an enzyme having a 
role in the regulation of blood pressure, and the digestive enzyme pepsin. 
These proteins possess approximate twofold symmetry, suggesting that the 
two halves are evolutionarily related. A likely scenario is that two copies of 
a gene for the ancestral enzyme fused to form a single gene that encoded a 
single-chain enzyme. Each copy of the gene would have contributed an as- 
partate residue to the active site. The human immunodeficiency virus (HIV) 
and other retroviruses contain an unfused dimeric aspartyl protease that is 
similar to the fused protein, but the individual chains are not joined to make 
a single chain (Figure 9-19). This observation is consistent with the idea 
that the enzyme may have originally existed as separate subunits. 

The metalloproteases constitute the final major class of peptide-cleaving 
enzymes. The active site of such a protein contains a bound metal ion, al- 
most always zinc, that activates a water molecule to act as a nucleophile to 
attack the peptide carbonyl group. The bacterial enzyme thermolysin and 
the digestive enzyme carboxypeptidase A are classic examples of the zinc 
proteases. Thermolysin, but not carboxypeptidase A, is a member of a large 
and diverse family of homologous zinc proteases that includes the matrix 
metalloproteases, enzymes that catalyze the reactions in tissue remodeling 
and degradation. 

In each of these three classes of enzymes, the active site includes features 
that allow for the activation of water or another nucleophile as well as for the 
polarization of the peptide carbonyl group and subsequent stabilization of a 
tetrahedral intermediate (see Figure 9.18). 


FIGURE 9.19 The structure 
of HIV protease and its bíndíng 
pocket. The protease is a dimer of identi- 
cal subunits, shown in blue and yellow, 
consisting of 99 amino adds each. The 
active-site aspartic acid residues, one from 
each chain, are shown as ball-and-stick 
structures. The flaps wili dose down on 
the binding pocket after substrate has 
been bound. 



9.1.7 Protease Inhibitors Are Important Drugs 

Compounds that block or modulate the activities of proteases can 
have dramatic biological effects. Most natural protease inhibitors are 
similar in structure to the peptide substrates of the enzyme that each inhibits 
(Section 10.7.4). Several important drugs are protease inhibitors. For example, 
captopril, an inhibitor of the metalloprotease angiotensin-converting enzyme 
(ACE), has been used to regulate blood pressure. Crixivan, an inhibitor of 
the HIV protease, is used in the treatment of AIDS. This protease cleaves 
multidomain virai proteins into their active forms; blocking this process com- 
pletely prevents the virus from being infectious (see 
Figure 9.19). To prevent unwanted side effects, protease 
inhibitors used as drugs must be specific for one enzyme 
without inhibiting other proteins within the body. 

Let us examine the interaction of Crixivan with HIV 
protease in more detail. Crixivan is constructed around 
an alcohol that mimics the tetrahedral intermediate; 
other groups are present to bind into the S 2 , Si, Si', and 
S 2 ' recognition sites on the enzyme (Figure 9.20). The 
results of x-ray crystallographic studies revealed the 
structure of the enzyme—Crixivan complex, showing that 
Crixivan adopts a conformation that approximates the 
twofold symmetry of the enzyme (Figure 9.21). The ac¬ 
tive site of HIV protease is covered by two apparently 
flexible flaps that fold down on top of the bound in- 
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FIGURE 9.20 Crixivan, an HIV 
protease inhibitor. The structure of 
Crixivan is shown in comparison with that 
of a peptide substrate of HIV protease. 
The scissile bond in the substrate is high- 
lighted in red. 


hibitor. The hydroxyl group of the central alcohol interacts with two as- 
partate residues of the active site, one in each subunit. In addition, two car- 
bonyl groups of the inhibitor are hydrogen bonded to a water molecule (not 
shown), which, in turn, is hydrogen bonded to a peptide NH group in each 
of the flaps. This interaction of the inhibitor with water and the enzyme is 
not possible with cellular aspartyl proteases such as renin and thus may con- 
tribute to the specificity of Crixivan and other inhibitors for HIV protease. 



FIGURE 9.21 HIV protease-Crixivan complex. (Left) The HIV protease is shown 
with the inhibitor crixivan bound at the active site. (Right) The drug has been rotated to 
reveal its approximately twofold symmetric conformation. 


9.2 MAKING A FAST REACTION FASTER; 

CARBONIC ANHYDRASES 

Carbon dioxide is a major end product of aerobic metabolism. In complex 
organisms, this carbon dioxide is released into the blood and transported 
to the lungs for exhalation. While in the blood, carbon dioxide reacts with 
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water. The product of this reaction is a moderately strong acid, carbonic 
acid (pKa = 3.5), which becomes bicarbonate ion on the loss of a proton. 


O 

II 

C + H 2 O 

II 
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o 



Carbonic 

acid 


O 



+ H+ 


Bicarbonate 

ion 


Even in the absence of a catalyst, this hydration reaction proceeds at a mod- 
erate pace. At 37°C near neutral pH, the second-order rate constant ki is 
0.0027 s~b This corresponds to an effective first-order rate constant 
of 0.15 s~^ in water ([H 2 O] = 55.5 M). Similarly, the reverse reaction, the 
dehydration of bicarbonate, is relatively rapid, with a rate constant of 
k-i = 50 s~\ These rate constants correspond to an equilibrium constant 
of Kl = 5.4 X 10 ~^ and a ratio of [CO 2 ] to [H 2 CO 3 ] of 340:1. 

T Despite the fact that CO 2 hydration and HC 03 ~dehydration occur 
spontaneously at reasonable rates in the absence of catalysts, almost 
all organisms contain enzymes, referred to as carbonic anhydrases, that cat- 
alyze these processes. Such enzymes are required because CO 2 hydration and 
HC 03 ~ dehydration are often coupled to rapid processes, particularly trans- 
port processes. For example, HC 03 ~ in the blood must be dehydrated to 
form CO 2 for exhalation as the blood passes through the lungs. Conversely, 
CO 2 must be converted into HC 03 ~ for the generation of the aqueous hu¬ 
mor of the eye and other secretions. Furthermore, both CO 2 and HC 03 ~ 
are substrates and products for a variety of enzymes, and the rapid inter- 
conversion of these species may be necessary to ensure appropriate substrate 
leveis. So important are these enzymes in human beings that mutations in 
some carbonic anhydrases have been found to cause osteopetrosis (excessive 
formation of dense bones accompanied by anemia) and mental retardation. 

Carbonic anhydrases accelerate CO 2 hydration dramatically. The most 
active enzymes, typified by human carbonic anhydrase II, hydrate CO 2 at 
rates as high as fecat = 10 ^s~\ora million times a second. Fundamental 
physical processes such as diffusion and proton transfer ordinarily limit the 
rate of hydration, and so special strategies are required to attain such prodi- 
gious rates. 


9.2.1 Carbonic Anhydrase Contains a Bound Zinc Ion Essential 
for Catalytic Activity 

Less than 10 years after the discovery of carbonic anhydrase in 1932, this 
enzyme was found to contain bound zinc, associated with catalytic activity. 
This discovery, remarkable at the time, made carbonic anhydrase the first 
known zinc-containing enzyme. At present, hundreds of enzymes are 
known to contain zinc. In fact, more than one-third of all enzymes either 
contain bound metal ions or require the addition of such ions for activity. 
The Chemical reactivity of metal ions—associated with their positive 
charges, with their ability to form relatively strong yet kinetically labile 
bonds, and, in some cases, with their capacity to be stable in more than one 
oxidation state—explains why catalytic strategies that employ metal ions 
have been adopted throughout evolution. 

The results of x-ray crystallographic studies have supplied the most de- 
tailed and direct information about the zinc site in carbonic anhydrase. At 
least seven carbonic anhydrases, each with its own gene, are present in hu¬ 
man beings. They are all clearly homologous, as revealed by substantial 
leveis of sequence identity. Carbonic anhydrase II, present in relatively 
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FIGURE 9.22 The structure of human carboníc anhydrase II and its zínc site. 

(Left) The zinc is bound to the imidazole rings of three histidine residues as well as to a 
water molecule. (Right) The location of the zinc site in the enzyme. 


Carbonic Anhydrases 


high concentrations in red blood cells, has been the most extensively stud- 
ied (Figure 9.22). 

Zinc is found only in the + 2 State in biological systems; so we need con- 
sider only this oxidation levei as we examine the mechanism of carbonic an¬ 
hydrase. A zinc atom is essentially always bound to four or more ligands; 
in carbonic anhydrase, three coordination sites are occupied by the imida¬ 
zole rings of three histidine residues and an additional coordination site is 
occupied by a water molecule (or hydroxide ion, depending on pH). Because 
all of the molecules occupying the coordination sites are neutral, the over- 
all charge on the Zn(His )3 unit remains +2. 


9.2.2 Catalysis Entails Zinc Activation of Water 

How does this zinc complex facilitate carbon dioxide hydration? A major 
clue comes from the pH profile of enzymatically catalyzed carbon dioxide 
hydration (Figure 9.23). At pH 8, the reaction proceeds near its maximal 
rate. As the pH decreases, the rate of the reaction drops. The midpoint of 
this transition is near pH 7, suggesting that a group with pK^ = 7 plays an 
important role in the activity of carbonic anhydrase and that the deproto- 
nated (high pH) form of this group participates more effectively in cataly¬ 
sis. Although some amino acids, notably histidine, have pK^ values near 7, 
a variety of evidence suggests that the group responsible for this transition is 
not an amino acid but is the zinc-bound water molecule. Thus, the binding of 
a water molecule to the positively charged zinc center reduces the pK^ of 
the water molecule from 15.7 to 7 (Figure 9.24). With the lowered pKa, a 



FIGURE 9.23 Effect of pH on carboníc 
anhydrase activity. Changes in pH 
alter the rate of carbon dioxide hydration 
catalyzed by carbonic anhydrase 11. The 
enzyme is maximally active at high pH. 
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FIGURE 9.24 The p/Cg of water-bound zinc. Binding to zinc lowers the pK^ of water 
from 15.7 to 7. 









H^o^H 


Zn 


His" 


^ His 
His 


HC 03 “<^ 

H20 








His" 


/ 

-V 

Zn2+; . 

s. His 
His 



H^O- 


Zn 


His" 


His 

His 



His 



substantial concentration of hydroxide ion (bound to zinc) is generated at 
neutral pH. A zinc-bound hydroxide ion is sufficiently nucleophilic to at- 
tack carbon dioxide much more readily than water does. The importance of 
the zinc-bound hydroxide ion suggests a simple mechanism for carbon diox¬ 
ide hydration (Figure 9.25). 

1. Zinc facilitates the release of a proton from a water molecule, which gen- 
erates a hydroxide ion. 

2. The carbon dioxide substrate binds to the enzyme’s active site and is po- 
sitioned to react with the hydroxide ion. 

3. The hydroxide ion attacks the carbon dioxide, converting it into bicar- 
bonate ion. 


FIGURE 9.25 Mechanism of carboníc 
anhydrase. The zinc-bound hydroxide 
mechanism for the hydration of carbon 
dioxide catalyzed by carbonic anhydrase. 


4. The catalytic site is regenerated with the release of the bicarbonate ion 
and the binding of another molecule of water. 

Thus, the binding of water to zinc favors the formation of the transition 
State, leading to bicarbonate formation by facilitating proton release and by 
bringing the two reactants into dose proximity. A range of studies supports 
this mechanism. In particular, studies of a synthetic analog model system pro- 
vide evidence for its plausibility. A simple synthetic ligand binds zinc 
through four nitrogen atoms (compared with three histidine nitrogen atoms 
in the enzyme), as shown in Figure 9.26. One water molecule remains bound 
to the zinc ion in the complex. Direct measurements reveal that this water 
molecule has a píCa value of 8.7, not as low as the value for the water mol¬ 
ecule in carbonic anhydrase but substantially lower than the value for free 
water. At pH 9.2, this complex accelerates the hydration of carbon dioxide 
more than 100-fold. Although catalysis by this synthetic system is much 
less efficient than catalysis by carbonic anhydrase, the model system 
strongly suggests that the zinc-bound hydroxide mechanism is likely to be 
correct. Carbonic anhydrases have evolved to utilize the reactivity intrinsic 
to a zinc-bound hydroxide ion as a potent catalyst. 


FIGURE 9.26 A synthetic analog 
model system for carbonic anhydrase. 

(A) An organic compound, capable of 
binding zinc, was synthesized as a model 
for carbonic anhydrase. The zinc complex 
of this ligand accelerates the hydration of 
carbon dioxide more than 100-fold under 
appropriate conditions. (B) The structure 
of the presumed active complex showing 
zinc bound to the ligand and to one water 
molecule. 


(A) 




9.2.3 A Proton Shuttie Facilitates Rapid Regeneration of the 
Active Form of the Enzyme 

As noted earlier, some carbonic anhydrases can hydrate carbon dioxide at 
rates as high as a million times a second (10^ s~^). The magnitude of this rate 
can be understood from the following observations. At the conclusion of a car¬ 
bon dioxide hydration reaction, the zinc-bound water molecule must lose a 
proton to regenerate the active form of the enzyme (Figure 9.27). The rate of 
the rever se reaction, the protonation of the zinc-bound hydroxide ion, is lim- 
ited by the rate of proton diffusion. Protons diffuse very rapidly with second- 
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FIGURE 9.27 Kínetícs of water deprotonation. The kinetics of deprotonation and pro- 
tonation of the zinc-bound water molecule in carbonic anhydrase. 
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FIGURE 9.28 The effect of buffer on deprotonation. The deprotonation of the zinc- 
bound water molecule in carbonic anhydrase is aided by buffer component B. 
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order rate constants near 

fe_i must be less than 10^^ s~b Because the equilibrium constant K is 
equal to ki/k-i, the forward rate constant is given hy ki = K • fe_i. Thus, 
if k-i < 10^^ s~^ and K = 10~^ M (because pK^ = 7), then ki must 

be less than or equal to s~^. In other words, the rate of proton diffusion 
limits the rate of proton release to less than s~^ for a group with pfCa= 7. 
However, if carbon dioxide is hydrated at a rate of 10^ s~^, then every step 
in the mechanism (see Figure 9.25) must take place at least this fast. How 
can this apparent paradox be resolved? 

The answer became clear with the realization that the highest rates of car¬ 
bon dioxide hydration require the presence of buffer, suggesting that the buffer 
components participate in the reaction. The buffer can bind or release protons. 
The advantage is that, whereas the concentrations of protons and hydroxide 
ions are limited to M at neutral pH, the concentration of buffer com¬ 
ponents can be much higher, on the order of several millimolar. If the buffer 
component BH^ has a pK^ of 7 (matching that for the zinc-bound water), 
then the equilibrium constant for the reaction in Figure 9.28 is 1. The rate 
of proton abstraction is given hy kf • [B]. The second-order rate constants 
kf and fe_/will be limited by buffer diffusion to values less than approxi- 


-3 


10^ s“\ 


mately 10^ M s . Thus, buffer concentrations greater than [B] = 10 
M (1 mM) may be high enough to support carbon dioxide hydration rates 
of 10^ s“^ because kf * [B] = (10^ s“^) • (10“^ M) 

This prediction is confirmed experimentally (Figure 9.29). 

The molecular components of many buffers are too 
large to reach the active site of carbonic anhydrase. Car¬ 
bonic anhydrase II has evolved a proton shuttle to allow 
buffer components to participate in the reaction from so- 
lution. The primary component of this shuttle is histidine 
64. This residue transfers protons from the zinc-bound C 
water molecule to the protein surface and then to the ^ 
buffer (Figure 9.30). Thus, catalytic function has been en- 
hanced through the evolution of an apparatusfor controlling 
proton transfer from and to the active site. Because protons 
participate in many biochemical reactions, the manipula- 
tion of the proton inventory within active sites is crucial 
to the function of many enzymes and explains the promi- 
nence of acid-base catalysis. 


FIGURE 9.29 The effect of buffer 
concentration on the rate of carbon 
dioxide hydration. The rate of carbon 
dioxide hydration increases with the con¬ 
centration of the buffer 1,2-dimethylbenz- 
imidazole. The buffer enables the enzyme 
to achieve its high catalytic rates. 
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FIGURE 9.30 Histidíne proton shuttle. 

(1) Histidine 64 abstracts a proton from 
the zinc bound water molecule, generating 
a nucleophilic hydroxide ion and a proto- 
nated histidine. (2) The buffer (B) removes 
a proton from the histidine, regenerating 
the unprotonated form. 


9.2.4 Convergent Evolution Has Generated Zinc-Based Active 
Sites in Different Carbonic Anhydrases 

Y Carbonic anhydrases homologous to the human enzymes, referred 
to as a-carbonic anhydrases, are common in animais and in some 
bactéria and algae. In addition, two other families of carbonic anhydrases 
have been discovered. The ^-carbonic anhydrases are found in higher 
plants and in many bacterial species, including E. coli. These proteins con- 
tain the zinc required for catalytic activity but are not significantly simi¬ 
lar in sequence to the o:-carbonic anhydrases. Furthermore, the |3-carbonic 
anhydrases have only one conserved histidine residue, whereas the a- 
carbonic anhydrases have three. No three-dimensional structure is yet 
available, but spectroscopic studies suggest that the zinc is bound by one 
histidine residue, two cysteine residues (conserved among |3-carbonic an¬ 
hydrases), and a water molecule. A third family, the y-carbonic anhydrases, 
also has been identified, initially in the archaeon Methanosarcina ther- 
mophila. The crystal structure of this enzyme reveals three zinc sites ex- 
tremely similar to those in the a-carbonic anhydrases. In this case, how- 
ever, the three zinc sites lie at the interfaces between the three subunits of 
a trimeric enzyme (Figure 9.31). The very striking left-handed (3-helix (a 
P strand twisted into a left-handed helix) structure present in this enzyme 
has also been found in enzymes that catalyze reactions unrelated to those 
of carbonic anhydrase. Thus, convergent evolution has generated carbonic 
anhydrases that rely on coordinated zinc ions at least three times. In each 



FIGURE 9.31 y-Carbonic anhydrase. (Left) The zinc site of 7 -carbonic anhydrase. 
(Middie) The trimeric structure of the protein (individual chains are labeled A, B, and C). 
(Right) The protein is rotated to show a top-down view that highlights its threefold symme- 
try and the position of the zinc sites (green) at the interfaces between subunits. 










case, the catalytic activity appears to be associated with zinc-bound wa- 
ter molecules. 


9.3 RESTRICTION ENZYIVIES; PERFORMING HIGHLY 
SPECIFIC DNA-CLEAVAGE REACTIONS 

Let us next consider a hydrolytic reaction that results in the cleavage of 
DNA. Bactéria and archaea have evolved mechanisms to protect themselves 
from virai infections. Many viruses inject their DNA genomes into cells; 
once inside, the virai DNA hijacks the celbs machinery to drive the pro- 
duction of virai proteins and, eventually, of progeny virus. Often, a virai in- 
fection results in the death of the host. A major protective strategy for the 
host is to use restriction endonucleases (restriction enzymes) to degrade the 
virai DNA on its introduction into a cell. These enzymes recognize partic¬ 
ular base sequences, called recognition sequences or recognition sites^ in their 
target DNA and cleave that DNA at defined positions. The most well stud- 
ied class of restriction enzymes comprises the so-called type II restriction 
enzymes, which cleave DNA within their recognition sequences. Other 
types of restriction enzymes cleave DNA at positions somewhat distant 
from their recognition sites. 

Restriction endonucleases must show tremendous specificity at two lev¬ 
eis. First, they must cleave only DNA molecules that contain recognition sites 
(hereafter referred to as cognate DNA) without cleaving DNA molecules that 
lack these sites. Suppose that a recognition sequence is six base pairs long. 
Because there are 4^, or 4096, sequences having six base pairs, the concen- 
tration of sites that must not be cleaved will be approximately 5000-fold as 
high as the concentration of sites that should be cleaved. Thus, to keep from 
damaging host-cell DNA, endonucleases must cleave cognate DNA mole¬ 
cules much more than 5000 times as efficiently as they cleave nonspecific 
sites. Second, restriction enzymes must not degrade the host DNA. How do 
these enzymes manage to degrade virai DNA while sparing their own? 

The restriction endonuclease FcoRV (from E. coli) cleaves double- 
stranded virai DNA molecules that contain the sequence 5'-GATATC-3' 
but leaves intact host DNA containing hundreds of such sequences. The 
host DNA is protected by other enzymes called methylases, which meth- 
ylate adenine bases within host recognition sequences (Figure 9.32). For 
each restriction endonuclease, the host cell produces a corresponding 
methylase that marks the host DNA and prevents its degradation. These 
pairs of enzymes are referred to as restriction-modification systems. We shall 
return to the mechanism used to achieve the necessary leveis of specificity 
after considering the chemistry of the cleavage process. 


Cleaved Not cleaved 

5 '-GATATC — 3' 5' G/ÍTATC-3' 



FIGURE 9.32 Protection by methylatíon. The recognition sequence for EcoRV endonu¬ 
clease (left) and the sites of methylation (right) in DNA protected from the catalytic action 
of the enzyme. 
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9.3.1 Cleavage Is by In-Line Displacement of 5 ' Oxygen 
from Phosphorus by Magnesium-Activated Water 

The fundamental reaction catalyzed by restriction endonucleases is the 
hydrolysis of the phosphodiester backbone of DNA. Specifically, the bond 
between the 3' oxygen atom and the phosphorus atom is broken. The prod- 
ucts of this reaction are DNA strands with a free 3'-hydroxyl group and a 
5'-phosphoryl group (Figure 9.33). This reaction proceeds by nucleophilic 
attack at the phosphorus atom. We will consider two types of mechanism, 
as suggested by analogy with the proteases. The restriction endonuclease 
might cleave DNA in mechanism 1 through a covalent intermediate, em- 
ploying a potent nucleophile (Nu), or in mechanism 2 by direct hydrolysis: 


Mechanism Type 1 (covalent intermediate) 


\F 

+ enzyme-NuH 

R20^ ORi 


^P 

enzyme-Nu ^OR 2 + RiOH 


enzyme 



+ H 2 O 


enzyme-NuH + 



Mechanism Type 2 (direct hydrolysis) 


R20'^ ^ORi 


+ H 2 O 


RiOH + 


XF 



Each postulates a different nucleophile to carry out the attack on the 
phosphorus. In either case, each reaction takes place by an in-line displace¬ 
ment path: 


RiO^ 


Nu + 



R3O 


ORi 

Nu P L 
R2O bRs 


ORi 


N 


+ L 


^0R3 


OR2 


The incoming nucleophile attacks the phosphorus atom, and a pentacoordi- 
nate transition state is formed. This species has a trigonal bipyramidal geom- 
etry centered at the phosphorus atom, with the incoming nucleophile at one 




FIGURE 9.33 Hydrolysis of a phosphodiester bond. All restriction enzymes catalyze 
the hydrolysis of DNA phosphodiester bonds, leaving a phosphoryl group attached to the 5' 
end. The bond that is cleaved is shown in red. 
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apex of the two pyramids and the group that is displaced 
(the leaving group, l) at the other apex. The two mech- 
anisms differ in the number of times the displacement 
occurs in the course of the reaction. 5 ' 

In the first type of mechanism, a nucleophile in the 
enzyme (analogous to serine 195 in chymotrypsin) at- 
tacks the phosphoryl group to form a covalent interme- 3 ^^ 
diate. In a second step, this intermediate is hydrolyzed 
to produce the final products. Because two displacement 
reactions take place at the phosphorus atom in the first 
mechanism, the stereochemical configuration at the 
phosphorus atom would be inverted and then inverted . 

again, and the overall configuration would be retained. X = 

In the second type of mechanism, analogous to that used T A 
by the aspartyl and metalloproteases, an activated water 
molecule attacks the phosphorus atom directly. In this 
mechanism, a single displacement reaction takes place at the phosphorus 
atom. Hence, the stereochemical configuration of the tetrahedral phospho¬ 
rus atom is inverted each time a displacement reaction takes place. Moni- 
toring the stereochemical changes of the phosphorus could be one approach 
to determining the mechanism of restriction endonuclease action. 

A difficulty is that the phosphorus centers in DNA are not chiral, be¬ 
cause two of the groups bound to the phosphorus atom are simple oxygen 
atoms, identical with each other. This difficulty can be circumvented by 
preparing DNA molecules that contain chiral phosphoryl groups, made 
by replacing one oxygen atom with sulfur (called a phosphorothioate). 
Let us consider EcoBM endonuclease. This enzyme cleaves the phospho- 
diester bond between the T and the A at the center of the recognition se- 
quence 5'-GATATC-3'. The first step in monitoring the activity of the 
enzyme is to synthesize an appropriate substrate for TcoRV containing 
phosphorothioates at the sites of cleavage (Figure 9.34). The reaction is 
then performed in water that has been greatly enriched in to allow 
the incoming oxygen atom to be marked. The location of the labei 
with respect to the sulfur atom indicates whether the reaction proceeds with 
inversion or retention of stereochemistry. The analysis revealed that the 
stereochemical configuration at the phosphorus atom was inverted only once 
with cleavage. This result is consistent with a direct attack of water at phos¬ 
phorus and rules out the formation of any covalently bound intermediate 
(Figure 9.35). 
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FIGURE 9.34 Labeling with phospho¬ 
rothioates. Phosphorothioates, groups 
in which one of the nonbridging oxygen 
atoms is replaced with a sulfur atom, 
can be used to labei specific sites in the 
DNA backbone to determine the overall 
stereochemical course of a displacement 
reaction. Here, a phosphorothioate is 
placed at sites that can be cleaved by 
EcoRV endonuclease. 



Inverted Not inverted 

(not observed) 


FIGURE 9.35 Stereochemistry of cleaved DNA. Cleavage of DNA by EcoRV endonucle¬ 
ase resuits in overall inversion of the stereochemical configuration at the phosphorus atom, 
as indicated by the stereochemistry of the phosphorus atom bound to one bridging oxygen 
atom, one one and one sulfur atom. This configuration strongly suggests that the 
hydrolysis takes place by the direct attack of water on the phosphorus atom. 
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FIGURE 9.36 Magnesíum ion bíndíng 
site in EcoRV endonuclease. The 

magnesium ion helps to activate a water 
molecule and positions it so that it can 
attack the phosphate. 


9.3.2 Restriction Enzymes Require Magnesium 
for Catalytic Activity 

Restriction endonucleases as well as many other enzymes that act on phos- 
phate-containing substrates require or some other similar divalent 

cation for activity. What is the function of this metal? 

It has been possible to examine the interactions of the magnesium ion 
when it is bound to the enzyme. Crystals have been produced of £coRV en¬ 
donuclease bound to oligonucleotides that contain the appropriate recogni- 
tion sequences. These crystals are grown in the absence of magnesium to 
prevent cleavage; then, when produced, the crystals are soaked in Solutions 
containing the metal. No cleavage takes place, allowing the location of the 
magnesium ion binding sites to be determined (Figure 9.36). The magne¬ 
sium ion was found to be bound to six ligands: three are water molecules, 
two are carboxylates of the enzyme s aspartate residues, and one is an oxy- 
gen atom of the phosphoryl group at the site of cleavage. The magnesium 
ion holds a water molecule in a position from which the water molecule can 
attack the phosphoryl group and, in conjunction with the aspartate residues, 
helps polarize the water molecule toward deprotonation. Because cleavage 
does not take place within these crystals, the observed structure cannot be 
the true catalytic conformation. Additional studies have revealed that a sec- 
ond magnesium ion must be present in an adjacent site for FcoRV endonu¬ 
clease to cleave its substrate. 



Asp 74 


Adenine 


Thymine 


Scissile bond 


Asp 90 


9.3.3 The Complete Catalytic Apparatus Is Assembled 
Only Within Complexes of Cognate DNA Molecules, 

Ensuring Specificity 

We now return to the question of specificity, the defining feature of re¬ 
striction enzymes. The recognition sequences for most restriction endonu¬ 
cleases are inverted repeats. This arrangement gives the three-dimensional 
structure of the recognition site a twofold rotational symmetry (Figure 9.37). 
The restriction enzymes display a corresponding symmetry to facilitate 
recognition: they are dimers whose two subunits are related by twofold ro¬ 
tational symmetry. The matching symmetry of the recognition sequence and 
the enzyme has been confirmed by the determination of the structure of the 
complex between FcoRV endonuclease and DNA fragments containing its 
recognition sequence (Figure 9.38). The enzyme surrounds the DNA in a 
tight embrace. Examination of this structure reveals features that are highly 
significant in determining specificity. 
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FIGURE 9.37 Structure of the 
recognition site of EcoRV endonucle- 
ase. (A) The sequence of the recognition 
site, which is symmetric around the axis 
of rotation designated in green. (B) The 
inverted repeat within the recognition 
sequence of EcoRV (and most other 
restriction endonucleases) endows the 
DNA site with twofold rotational symmetry. 


A unique set of interactions occurs between the enzyme and a cognate 
DNA sequence. Within the 5'-GATATC-3' sequence, the G and A bases 
at the 5' end of each strand and their Watson-Crick partners directly con- 
tact the enzyme by hydrogen bonding with residues that are located in two 
loops, one projecting from the surface of each enzyme subunit (Figure 9.39). 
The most striking feature of this complex is the distortion of the DNA, which 
is substantially kinked in the center (Figure 9.40). The central two TA base 
pairs in the recognition sequence play a key role in producing the kink. They 
do not make contact with the enzyme but appear to be required because of 
their ease of distortion. 5' -TA-3' sequences are known to be among the most 
easily deformed base pairs. The distortion of the DNA at this site has se- 
vere effects on the specificity of enzyme action. 

Specificity is often determined by an enzyme s binding affinity for sub- 
strates. In regard to FcoRV endonuclease, however, binding studies per- 
formed in the absence of magnesium have demonstrated that the enzyme 
binds to all sequences, both cognate and noncognate, with approximately 
equal affinity. However, the structures of complexes formed with noncog¬ 
nate DNA fragments are strikingly different from those formed with cog¬ 
nate DNA: the noncognate DNA conformation is not substantially distorted 
(Figure 9.41). This lack of distortion has important consequences with regard 
to catalysis. No phosphate is positioned sufficiently close to the active-site 



% FIGURE 9.38 Structure of the 
EcoRV-cognate DNA complex. This 
view of the structure of EcoRV endonucle¬ 
ase bound to a cognate DNA fragment is 
down the helical axis of the DNA. The two 
protein subunits are in yellow and blue, 
and the DNA backbone is in red. The 
twofold axes of the enzyme dimer and the 
DNA are aligned. 









(A) 



FIGURE 9.39 Hydrogen bondíng interactíons between EcoRV en- 
donuclease and íts DNA substrate. One of the DNA-binding loops (ín green) 
of EcoRV endonuclease is shown interacting with the base pairs of its cognate 
DNA binding site. Key amino add residues are shown hydrogen bonding with 
(B) a CG base pair and (C) an AT base pair. 




FIGURE 9.40 Dístortion of the recog- 
nition site. The DNA is represented as a 
ball-and-stick model. The path of the DNA 
helical axis, shown in red, is substantially 
distorted on binding to the enzyme. For 
the B form of DNA, the axis is straight 
(not shown). 



aspartate residues to complete a magnesium ion binding site (see Figure 9.36). 
Hence, the nonspecific complexes do not bind the magnesium ion and the 
complete catalytic apparatus is never assembled. The distortion of the sub¬ 
strate and the subsequent binding of the magnesium ion account for the cat¬ 
alytic specificity of more than 1,000,000-fold that is observed for FcoRV en¬ 
donuclease despite very little preference at the levei of substrate binding. 

We can now see the role of binding energy in this strategy for attaining 
catalytic specificity. In binding to the enzyme, the DNA is distorted in such 
a way that additional contacts are made between the enzyme and the 
substrate, increasing the binding energy. However, this increase is canceled 
by the energetic cost of distorting the DNA from its relaxed conformation 
(Figure 9.42). Thus, for FcoRV endonuclease, there is little difference in 
binding affinity for cognate and nonspecific DNA fragments. However, 
the distortion in the cognate complex dramatically affects catalysis by 









FIGURE 9.41 Nonspecific and cognate DNA withín 
£coRV endonuclease. A comparison of the positions of the 
nonspecific (orange) and the cognate DNA (red) within EcoRV 
reveals that, in the nonspecific complex, the DNA backbone is 
too far from the enzyme to complete the magnesium ion bind- 
ing sites. 
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FIGURE 9.42 Greater bínding energy of EcoRV endonuclease 
bound to cognate versus noncognate DNA. The additional 
interactions between EcoRV endonuclease and cognate DNA 
increase the binding energy, which can be used to drive DNA 
distortions necessary for forming a catalytically competent complex. 


EcoRV 


completing the magnesium ion binding site. This example illustrates how 
enzymes can utilize available binding energy to deform substrates and poise them 
for Chemical transformation. Interactions that take place within the distorted 
substrate complex stabilize the transition state leading to DNA hydrolysis. 

The distortion in the DNA explains how methylation blocks catalysis 
and protects host-cell DNA. When a methyl group is added to the amino 
group of the adenine nucleotide at the 5' end of the recognition sequence, 
the methyl group s presence precludes the formation of a hydrogen bond be¬ 
tween the amino group and the side-chain carbonyl group of aspar agine 185 
(Figure 9.43). This asparagine residue is closely linked to the other amino 
acids that form specific contacts with the DNA. The absence of the hydro¬ 
gen bond disrupts other interactions between the enzyme and the DNA 
substrate, and the distortion necessary for cleavage will not take place. 




9.3.4 Type II Restriction Enzymes Have a Catalytic Core 
in Common and Are Probably Related by Horizontal 
Gene Transfer 

Y Type II restriction enzymes are prevalent in Archaea and Eubacte- 
ria. What can we tell of the evolutionary history of these enzymes? 
Comparison of the amino acid sequences of a variety of type II restriction 
endonucleases did not reveal significant sequence similarity between most 
pairs of enzymes. However, a careful examination of three-dimensional 
structures, taking into account the location of the active sites, revealed the 
presence of a core structure conserved in the different enzymes. This struc- 
ture includes |3 strands that contain the aspartate (or, in some cases, gluta- 
mate) residues forming the magnesium ion binding sites (Figure 9.44). 

These observations indicate that many type II restriction enzymes are in- 
deed evolutionary related. Analyses of the sequences in greater detail sug- 
gest that bactéria may have obtained genes encoding these enzymes from 


Methylated DNA 

FIGURE 9.43 Methylation of adenine. 

The methylation of adenine blocks the for¬ 
mation of hydrogen bonds between EcoRV 
endonuclease and cognate DNA molecules 
and prevents their hydrolysis. 
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FIGURE 9.44 A conserved 
structural core in type II restriction 
enzymes. Four conserved structural ele- 
ments, including the active-site region (in 
blue), are highlighted in color in these 
models of a single monomer from each 
dimeric enzyme. The positions of the 
amino acid sequences that form these ele- 
ments within each overall sequence are 
represented schematically below each 
structure. 


other species by horizontal gene transfer, the passing between species of pieces 
of DNA (such as plasmids) that provide a selective advantage in a particu¬ 
lar environment. For example, EcoKl (from E. coli) and Rsrl (from Rhodobac- 
ter sphaeroides) are 50% identical in sequence over 266 amino acids, clearly 
indicative of a dose evolutionary relationship. However, these species of bac¬ 
téria are not closely related, as is known from sequence comparisons of other 
genes and other evidence. Thus, it appears that these species obtained the gene 
for this restriction endonuclease from a common source more recently than the 
time of their evolutionary divergence. Moreover, the gene encoding EcoRl en¬ 
donuclease uses particular codons to specify given amino acids that are strik- 
ingly different from the codons used by most E. coli genes, which suggests 
that the gene did not originate in E. coli. Horizontal gene transfer may be a 
relatively common event. For example, genes that inactivate antibiotics are 
often transferred, leading to the transmission of antibiotic resistance from 
one species to another. For restriction -modification systems, protection 
against virai infections may have favored horizontal gene transfer. 


9.4 NUCLEOSIDE MONOPHOSPHATE KINASES; 
CATALYZING PHOSPHORYL GROUP EXCHANGE BETWEEN 
NUCLEOTIDES WITHOUT PROMOTING HYDROLYSIS 

The final enzymes that we shall consider are the nucleoside monophosphate 
kinases (NMP kinases), typified by adenylate kinase. These enzymes cat- 
alyze the transfer of the terminal phosphoryl group from a nucleoside 
triphosphate (NTP), usually ATP, to the phosphoryl group on a nucleoside 
monophosphate (Figure 9.45). The challenge for NMP kinases is to pro- 
mote the transfer of the phosphoryl group from NTP to NMP without pro- 
moting the competing reaction —the transfer of a phosphoryl group from 
NTP to water; that is, NTP hydrolysis. We shall see how the use of induced 
fit by these enzymes is used to solve this problem. Moreover, these enzymes 
employ metal ion catalysis; but, in this case, the metal forms a complex with 
the substrate to enhance enzyme-substrate interaction. 



FIGURE 9.45 Phosphoryl group transfer by nucleoside monophosphate kinases. 

These enzymes catalyze the interconversion of nucleoside triphosphate (here, ATP) and a 
nucleoside monophosphate (NMP) into two nucleoside diphosphates by the transfer of a 
phosphoryl group (shown in red). 





























9.4.1 NMP Kinases Are a Family of Enzymes Containing I 

P-Loop Structures NMP Kinases 

X-ray crystallographic methods have yielded the three-dimensional struc¬ 
tures of a number of different NMP kinases, both free and bound to sub- 
strates or substrate analogs. Comparison of these structures reveals that these 
enzymes form a family of homologous proteins (Figure 9.46). In particular, 
such comparisons reveal the presence of a conserved NTP-binding domain. 

This domain consists of a central |3 sheet, surrounded on both sides by a he- 
lices (Figure 9.47). A characteristic feature of this domain is a loop between 
the first |3 strand and the first helix. This loop, which typically has an amino 
acid sequence of the form Gly-X-X-X-X-Gly-Lys, is often referred to as the 
P-loop because it interacts with phosphoryl groups on the bound nucleotide 
(Figure 9.48). As described in Section 9.4.4, similar domains containing P- 
loops are present in a wide variety of important nucleotide-binding proteins. 





FIGURE 9.46 Structures of 
adenylate kínase and guanylate 
kinase. The nucleoside triphosphate- 
binding domain is a common feature in 
these and other homologous nucleotide 
kinases. The domain consists of a central 
p-pleated sheet surrounded on both sides 
by a helices (highlighted in purple) as 
well as a key loop (shown in green). 


Adenylate kinase 



FIGURE 9.47 The core domain 
of NMP kinases. The P-loop is shown in 
green. The dashed lines represent the re- 
mainder of the protein structure. 


FIGURE 9.48 P-loop interaction with ATP. The P-loop of adenylate kinase interacts 
with the phosphoryl groups of ATP (shown with dark bonds). Hydrogen bonds (green) 
link ATP to peptide NH groups as well as a lysine residue conserved among NMP kinases. 
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NTP-Dependent Enzymes 

Kinetic studies of NMP kinases, as well as many other enzymes having ATP 
or other nucleoside tríphosphates as a substrate, reveal that these enzymes 
are essentially inactive in the absence of divalent metal ions such as mag¬ 
nesium (Mg^^) or manganese (Mn^^), but acquire activity on the addition 
of these ions. In contrast with the enzymes discussed so far, the metal is not 
a component of the active site. Rather, nucleotides such as ATP bind these 
ions, and it is the metal ion-nucleotide complex that is the true substrate for 
the enzymes. The dissociation constant for the ATP-Mg^^ complex is ap- 
proximately 0.1 mM, and thus, given that intracellular Mg^^ concentra- 
tions are typically in the millimolar range, essentially all nucleoside triphos- 
phates are present as NTP-Mg^^ complexes. 

How does the binding of the magnesium ion to the nucleotide affect 
catalysis? There are a number of related consequences, but all serve to en- 
hance the specificity of the enzyme-substrate interactions by enhancing 
binding energy. First, the magnesium ion neutralizes some of the negative 
charges present on the polyphosphate chain, reducing nonspecific ionic in¬ 
teractions between the enzyme and the polyphosphate group of the nu¬ 
cleotide. Second, the interactions between the magnesium ion and the oxy- 
gen atoms in the phosphoryl group hold the nucleotide in well-defined 
conformations that can be specifically bound by the enzyme (Figure 9.49). 
Magnesium ions are typically coordinated to six groups in an octahedral 
arrangement. Typically, two oxygen atoms are directly coordinated to a 
magnesium ion, with the remaining coordination positions often occupied 



FIGURE 9.49 The structures of two ísomeríc forms of the ATP-Mg^^ com¬ 
plex. Other groups coordinated to the magnesium ion have been omitted for clarity. 


FIGURE 9.50 ATP-IVIg^+ complex 
bound to adenylate kínase. The 

magnesium ion is bound to the p and 7 
phosphoryl groups, and the four water 
molecules bound to the remaining 
coordination positions interact with groups 
on the enzyme, including a conserved 
aspartate residue. Other interactions have 
been omitted for clarity. 



Magnesium-ATP complex 






by water molecules. Oxygen atoms of the a and |3, |3 and 7 , or a and 7 phos- 
phoryl groups may contribute, depending on the particular enzyme. In ad- 
dition, different stereoisomers are produced, depending on exactly which 
oxygen atoms bind to the metal ion. Third, the magnesium ion provides ad- 
ditional points of interaction between the ATP-Mg^^ complex and the en¬ 
zyme, thus increasing the binding energy. In some cases, such as the DNA 
polymerases (Section 27.x.x), side chains (often aspartate and glutamate 
residues) of the enzyme can bind directly to the magnesium ion. In other 
cases, the enzyme interacts indirectly with the magnesium ion through hy- 
drogen bonds to the coordinated water molecules (Figure 9.50). Such in- 
teractions have been observed in adenylate kinases bound to ATP analogs. 

9.4.3 ATP Binding Induces Large Conformational Changes 

Comparison of the structure of adenylate kinase in the presence and absence 
of an ATP analog reveals that substrate binding induces large structural 
changes in the kinase, providing a classic example of the use of induced fit 
(Figure 9.51). The P-loop doses down on top of the polyphosphate chain, 
interacting most extensively with the |3 phosphoryl group. The movement 
of the P-loop permits the top domain of the enzyme to move down to form 
a lid o ver the bound nucleotide. This motion is favored by interactions be¬ 
tween basic residues (conserved among the NMP kinases), the peptide back¬ 
bone NH groups, and the nucleotide. With the ATP nucleotide held in this 
position, its 7 phosphoryl group is positioned next to the binding site for 
the second substrate, NMP. In sum, the direct interactions with the nu¬ 
cleotide substrate lead to local structural rearrangements (movement of the 
P-loop) within the enzyme, which in turn allow more extensive changes (the 
closing down of the top domain) to take place. The binding of the second 
substrate, NMP, induces additional conformational changes. Both sets of 
changes ensure that a catalytically competent conformation is formed only 
when both the donor and the acceptor are bound, preventing wasteful trans- 
fer of the phosphoryl group to water. The enzyme holds its two substrates 
dose together and appropriately oriented to stabilize the transition state that 
leads to the transfer of a phosphoryl group from the ATP to the NMP. This 
is an example of catalysis by approximation. We will see such examples of a 
catalytically competent active site being generated only on substrate bind¬ 
ing many times in our study of biochemistry. 

9.4.4 P-Loop NTPase Domains Are Present in a 
Range of Important Proteins 

Y Domains similar (and almost certainly homologous) to those found 
in NMP kinases are present in a remarkably wide array of proteins, 
many of which participate in essential biochemical processes. Examples in- 
clude ATP synthase, the key enzyme responsible for ATP generation; mo¬ 
lecular motor proteins such as myosin; signal-transduction proteins such as 
transducin; proteins essential for translating mRNA into proteins, such as 
elongation factor Tu; and DNA and RN A unwinding helicases. The wide 
utility of P-loop NTPase domains is perhaps best explained by their abil- 
ity to undergo substantial conformational changes on nucleoside triphos- 
phate binding and hydrolysis. We shall encounter these domains (hereafter 
referred to as P-loop NTPases) throughout the book and shall observe how 
they function as springs, motors, and clocks. To allow easy recognition of 
these domains, they, like the binding domains of the NMP kinases, will be 
depicted with the inner surfaces of the ribbons in a ribbon diagram shown 
in purple and the P-loop shown in green (Figure 9.52). 





% FIGURE 9.51 Conformational 
changes in adenylate kinase. Large 
conformational changes are associated 
with the binding of ATP by adenylate ki¬ 
nase. The P-loop is shown in green in 
each structure. The lid domain is high- 
lighted in yellow. 
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a subunit of transducin 


FIGURE 9.52 Three proteíns 
contaíníng P-loop NTPase domains. 

For the conserved domain, the inner sur- 
faces of the ribbons are purple and the 
P-loops are green. 



P subunit of ATP synthase 


SUMMARY 


Enzymes adopt conformations that are structurally and chemically 
complementary to the transition States of the reactions that they cat- 
alyze. Sets of interacting amino acid residues make up sites with the spe- 
cial structural and chemical properties necessary to stabilize the transi¬ 
tion State. Enzymes use five basic strategies to form and stabilize the 
transition state: (1) the use of binding energy, (2) covalent catalysis, (3) 
general acid-base catalysis, (4) metal ion catalysis, and (5) catalysis by 
approximation. Of the enzymes examined in this chapter, three groups 
of enzymes catalyze the addition of water to their substrates but have 
different requirements for catalytic speed and specificity, and a fourth 
group of enzymes must prevent reaction with water. 

Proteases: Facilitating a Difficult Reaction 

The cleavage of peptide bonds by chymotrypsin is initiated by the attack 
of a serine residue on the peptide carbonyl group. The attacking hydroxyl 
group is activated by interaction with the imidazole group of a histidine 
residue, which is, in turn, linked to an aspartate residue. This Ser-His- 
Asp catalytic triad generates a powerful nucleophile. The product of this 







initial reaction is a covalent intermediate formed by the enzyme and an 
acyl group derived from the bound substrate. The hydrolysis of this acyl- 
enzyme intermediate completes the cleavage process. The tetrahedral in- 
termediates for these reactions have a negative charge on the peptide car- 
bonyl oxygen atom. This negative charge is stabilized by interactions with 
peptide NH groups in a region on the enzyme termed the oxyanion hole. 

Other proteases employ the same catalytic strategy. Some of these pro- 
teases, such as trypsin and elastase, are homologs of chymotrypsin. In other 
proteases, such as subtilisin, a very similar catalytic triad has arisen by con- 
vergent evolution. Active-site structures that differ from the catalytic triad 
are present in a number of other classes of proteases. These classes employ 
a range of catalytic strategies but, in each case, a nucleophile is generated 
that is sufficiently powerful to attack the peptide carbonyl group. In some 
enzymes, the nucleophile is derived from a side chain; whereas, in others, 
an activated water molecule attacks the peptide carbonyl directly. 

• Carbonic Anhydrases: Making a Fast Reaction Faster 

Carbonic anhydrases catalyze the reaction of water with carbon dioxide 
to generate carbonic acid. The catalysis can be extremely fast: molecules 
of some carbonic anhydrases hydrate carbon dioxide at rates as high as 
1 million times per second. A tightly bound zinc ion is a crucial com- 
ponent of the active sites of these enzymes. Each zinc ion binds a wa¬ 
ter molecule and promotes its deprotonation to generate a hydroxide ion 
at neutral pH. This hydroxide attacks carbon dioxide to form bicar¬ 
bonato ion, HC 03 ~. Because of the physiological roles of carbon diox¬ 
ide and bicarbonato ions, speed is of the essence for this enzyme. To 
overcome limitations imposed by the rate of proton transfer from the 
zinc-bound water molecule, the most active carbonic anhydrases have 
evolved a proton shuttle to transfer protons to a buffer. 

• Restriction Enzymes: Performing Highly Specific DNA Cleavage Reactions 

A high levei of substrate specificity is often the key to biological func- 
tion. Restriction endonucleases that cleave DNA at specific recognition 
sequences discriminate between molecules that contain these recogni¬ 
tion sequences and those that do not. Within the enzyme-substrate 
complex, the DNA substrate is distorted in a manner that generates a 
magnesium ion binding site between the enzyme and DNA. The mag- 
nesium ion binds and activates a water molecule, which attacks the 
phosphodiester backbone. 

Some enzymes discriminate between potential substratos by binding 
them with different affinities. Others may bind many potential substratos 
but promote chemical reactions efficiently only on specific molecules. 
Restriction endonucleases such as EcoRV endonuclease employ the lat- 
ter mechanism to achieve leveis of discrimination as high as million-fold. 
Structural studies reveal that these enzymes may bind nonspecific DNA 
molecules, but such molecules are not distorted in a manner that allows 
magnesium ion binding and, hence, catalysis. Restriction enzymes are 
prevented from acting on the DNA of a host cell by the methylation of 
key sites within their recognition sequences. The added methyl groups 
block specific interactions between the enzymes and the DNA such that 
the distortion necessary for cleavage does not take place. 

• Nucleoside Monophosphate Kinases: Catalyzing Phosphoryl Group 
Exchange Without Promoting Hydrolysis 

Finally, NMP kinases illustrate that induced fit—the alteration of en¬ 
zyme structure on substrate binding—facilitates phosphoryl transfer 
between nucleotides rather than to a molecule of water. This class of 
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enzyme displays a structural motif called the P-loop NTPase domain that 
is present in a wide array of nucleotide-binding proteins. The closing of 
the P-loop over a bound nucleoside triphosphate substrate permits the 
top domain of the enzyme to form a lid over the bound nucleotide, posi- 
tioning the triphosphate near the monophosphate with which it will re- 
act, in an example of catalysis by approximation. These enzymes are de- 
pendent on metal ions, but the ions bind to substrate instead of directly 
to the enzyme. The binding of the metal ion to the nucleoside triphos¬ 
phate enhances the specificity of the enzyme-substrate interactions by 
holding the nucleotide in a well-defined conformation and providing ad- 
ditional points of interaction, thus increasing binding energy. 


-1 KEY TERMS |— 

binding energy (p. 9-2) 
induced fit (p. 9-2) 
covalent catalysis (p. 9-2) 
general acid-base catalysis (p. 9-2) 
metal ion catalysis (p. 9-2) 
catalysis by approximation (p. 9-2) 


Chemical modification reaction (p. 9-4) 

catalytic triad (p. 9-5) 

oxyanion hole (p. 9-7) 

protease inhibitor (p. 9-12) 

proton shuttle (p. 9-17) 

recognition sequence (p. 9-19) 


restriction- modification system (p. 9-19) 
in-line displacement (p. 9-20) 
horizontal gene transfer (p. 9-26) 

P-loop (p. 9-27) 
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1. No burst. Examination of the cleavage of the chromogenic 
amide substrate, A, by chymotrypsin with the use of stopped- 
flow kinetic methods reveals no burst. Why? 



2. Contributing to your own demise. Consider the subtilisin sub¬ 
strates A and B. 

Phe-Ala-Gin-Phe-X Phe-Ala-His-Phe-X 

A B 

These substrates are cleaved (between Phe and X) by native sub¬ 
tilisin at essentially the same rate. However, the His 64-to-Ala 
mutant of subtilisin cleaves substrate B more than 1000-fold as 
rapidly as it cleaves substrate A. Propose an explanation. 

3. ] + ] 7 ^ 2. Consider the following argument. In subtilisin, 
mutation of Ser 221 to Ala results in a 10^-fold decrease in ac- 
tivity. Mutation of His 64 to Ala results in a similar 10^-fold de¬ 


crease. Therefore, simultaneous mutation of Ser 221 to Ala and 
His 64 to Ala should result in a 10^ X 10^ = 10^^-fold reduction 
in activity. Is this correct? Why or why not? 

4. Adding a charge. In chymotrypsin, a mutant was construeted 
with Ser 189, which is in the bottom of the substrate specificity 
pocket, changed to Asp. What effect would you predict for this 
Ser 189^ Asp 189 mutation? 

5. Conditional results. In carbonic anhydrase II, mutation of the 
proton-shuttie residue His 64 to Ala was expected to result in a 
decrease in the maximal catalytic rate. However, in buffers such 
as imidazole with relatively small molecular components, no rate 
reduction was observed. In buffers with larger molecular com¬ 
ponents, significant rate reduetions were observed. Propose an 
explanation. 

6. How many sites? A researcher has isolated a restriction en¬ 
donuclease that cleaves at only one particular 10-base-pair site. 
Would this enzyme be useful in protecting cells from virai in- 
fections, given that a typicai virai genome is 50,000 base pairs 
long? Explain. 

7. Is faster better? Restriction endonucleases are, in general, 
quite slow enzymes with ty picai turno ver numbers of 1 s~^. Sup- 
pose that endonucleases were faster with turnover numbers sim¬ 
ilar to those for carbonic anhydrase (10^ s~^). Would this in- 
creased rate be beneficiai to host cells, assuming that the fast 
enzymes have similar leveis of specificity? 
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8 . Adopting a new gene. Suppose that one species of bactéria 
obtained one gene encoding a restriction endonuclease by hori¬ 
zontal gene transfer. Would you expect this acquisition to be 
beneficiai? 

9. Predict the product. Adenylate kinase is treated with adeno- 
sine disphosphate (ADP). 

(a) What Products will be generated? 

(b) If the initial concentration of ADP is 1 mM, estimate the 
concentrations of ADP and the products from part a after incu- 
bation with adenylate kinase for a long time. 

10 . Chelation therapy. Treatment of carbonic anhydrase with 
high concentrations of the metal chelator EDTA (ethylenedi- 
aminetetraacetic acid) results in the loss of enzyme activity. Pro- 
pose an explanation. 

11 . Identify the enzyme. Consider the structure of molecule A. 
Which enzyme discussed in this chapter do you think molecule 
A will mo st effectively inhibit? 



Molecule A 


12 . An aldehyde inhibitor. Elastase is specifically inhibited by an 
aldehyde derivative of one of its substrates: 


HsQ 


A/-Acetyl-Pro-Ala-Pro— 

H 


O 


(a) Which residue in the active site of elastase is most likely to 
form a covalent bond with this aldehyde? 

(b) What type of covalent link would be formed? 

Mechanism Problem 

13. Complete the mechanism. On the basis of the Information 
provided in Figure 9.18, complete the mechanisms for peptide- 
bond cleavage by (a) a cysteine protease, (b) an aspartyl protease, 
and (c) a metalloprotease. 

Media Problems 

14. Nou) you see it, now you dont. Pre-steady-State experiments 
using chymotrypsin and a chromogenic substrate (N-acetyl- 
L-phenylalaninep-nitrophenyl ester) show a ‘‘burst” of product 
at very short times (Figure 9.4). The Conceptual Insights 
module on enzyme kinetics explains this result. What results 
would you see if the product detected by the assay was the 
free N-terminal component of the substrate instead of the 
C-terminal component? (Hint: Use the pre-steady-State reaction 
simulation to simulate the experiment. Select different times fol- 
lowing mixing and observe the amount of each product.). 

15. Seeing is dishelieving. DIPF reacts specifically with serine 195 
of chymotrypsin. One hypothesis as to why this is so might be that 
serine 195 is unusually exposed on the surface of the protein com- 
pared to other serines. After looking at the Structural Insights 
module on chymotrypsin, what do you think of this hypothesis? 



Like motor traffic, metabolic pathways flow more efficientiy 
when regulated by signals. CTP, the final product of a multístep 
pathway, Controls flux through the pathway by inhibiting the com- 
mitted step catalyzed by aspartate transcarbamoylase (ATCase). 
[(Left) Richard Berenholtz/The Stock Market] 


Regulatory Strategies: 
and Hemoglobin 


Enzymes 


ATCase 


T 

■ CTP 


The activity of proteins, including enzymes, often must be regulated so that 
they function at the proper time and place. The biological activity of pro¬ 
teins is regulated in four principal ways: 


1 . Allosteric control. Allosteric proteins contain distinct regulatory 
multiple functional sites. Regulation by small signal molecules is 
cant means of controlling the activity of many proteins. The 
binding of these regulatory molecules at sites distinct from 
the active site triggers conformational changes that are 
transmitted to the active site. Moreover, allosteric proteins 
show the property of cooperativity: activity at one functional 
site affects the activity at others. As a consequence, a slight 
change in substrate concentration can produce substantial 
changes in activity. Proteins displaying allosteric control are 
thus Information transducers: their activity can be modi- 
fied in response to signal molecules or to Information shared 
among active sites. This chapter examines two of the best- 
understood allosteric proteins: the enzyme aspartate trans¬ 
carbamoylase (ATCase) and the oxygen-carrying protein 
hemoglobin. Catalysis by aspartate transcarbamoylase of the 
first step in pyrimidine biosynthesis is inhibited by cytidine 
triphosphate, the final product of that biosynthesis, in an 
example of feedback inhibition. The binding of O 2 by he¬ 
moglobin is cooperative and is regulated by H^, CO 2 and 
2 ,3-bisphosphoglycerate (2,3-BPG). 


sites and 
a signifi- 


OUTLINE 


10.1 Aspartate Transcarbamoylase Is 
Allosterically Inhibited by the End 
Product of Its Pathway 

10.2 Hemoglobin Transports Oxygen 
Efficientiy by Binding Oxygen 
Cooperatively 

10.3 Isozymes Provide a Means 
of Regulation Specific to Distinct 
Tissues and Developmental Stages 

10.4 Covalent Modification Is a Means 
of Regulating Enzyme Activity 

10.5 Many Enzymes Are Activated 
by Specific Proteolytic Cleavage 
















-1 262 I- 

CHAPTER 10 • Regulatory Strategies: 

Enzymes and Hemoglobin 


FIGURE 10.1 ATCase reaction. 

Aspartate transcarbamoylase catalyzes 
the committed step, the condensation of 
aspartate and carbamoyl phosphate to 
form A/-carbannoylaspartate, in pyrimidine 
synthesis. 


2 . Multiple forms of enzymes. Isozymes, or isoenzymes, provide an avenue 
for varying regulation of the same reaction at distinct locations or times. 
Isozymes are homologous enzymes within a single organism that catalyze 
the same reaction but differ slightly in structure and more obviously in Km 
and Vmax values, as well as regulatory properties. Often, isozymes are ex- 
pressed in a distinct tissue or organelle or at a distinct stage of development. 

3. Reversible covalent modification. The catalytic properties of many enzymes 
are markedly altered by the covalent attachment of a modifying group, most 
commonly a phosphoryl group. ATP serves as the phosphoryl donor in these 
reactions, which are catalyzed by protein kinases. The removal of phospho¬ 
ryl groups by hydrolysis is catalyzed by protein phosphatases. This chapter 
considers the structure, specificity, and control oíprotein kinase A (PKA), a 
ubiquitous eukaryotic enzyme that regulates diverse target proteins. 

4. Proteolytic activation. The enzymes controlled by some of these mecha- 
nisms cycle between active and inactive States. A different regulatory mo- 
tif is used to irreversibly convert an inactive enzyme into an active one. Many 
enzymes are activated by the hydrolysis of a few or even one peptide bond 
in inactive precursors called zymogens or proenzymes. This regulatory mech- 
anism generates digestive enzymes such as chymotrypsin, trypsin, and 
pepsin. Caspases, which are proteolytic enzymes that are the executioners 
inprogrammed cell death, or apoptosis (Section 2.4.3), are proteolytically ac¬ 
tivated from the procaspase form. Blood clotting is due to a remarkable Cas¬ 
cade of zymogen activations. Active digestive and clotting enzymes are 
switched off by the irreversible binding of specific inhibitory proteins that 
are irresistible lures to their molecular prey. 

To begin, we will consider the principies of allostery by examining two 
proteins: the enzyme aspartate transcarbamoylase and the oxygen-trans- 
porting protein hemoglobin. 


10.1 ASPARTATE TRANSCARBAIVIOYLASE IS 
ALLOSTERICALLY INHIBITED BY THE END 
PRODUCT OF ITS PATHWAY 

Aspartate transcarbamoylase catalyzes the first step in the biosynthesis of 
pyrimidines, bases that are components of nucleic acids. The reaction cat¬ 
alyzed by this enzyme is the condensation of aspartate and carbamoyl phos¬ 
phate to form N-carbamoylaspartate and orthophosphate (Figure 10.1). 
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ATCase catalyzes the committed step in the pathway that will ultimately 
yield pyrimidine nucleotides such as cytidine triphosphate (CTP). How is 
this enzyme regulated to generate precisely the amount of CTP needed by 
the cell? 

John Gerhart and Arthur Pardee found that ATCase is inhibited by 
CTP, the final product of the ATCase-controlled pathway. The rate of the 
reaction catalyzed by ATCase is fast in the absence of high concentrations 
of CTP but decreases as the CTP concentration increases (Figure 10.2). 
Thus, more molecules are sent along the pathway to make new pyrimidines 
until sufficient quantities of CTP have accumulated. The effect of CTP on 
the enzyme exemplifies the feedback, or end-product, inhibition mentioned 
earlier. Despite the fact that end-product regulation makes considerable 
physiological sense, the observation that ATCase is inhibited by CTP is re- 
markable because CTP is structurally quite different from the substrates of 
the reaction (see Figure 10.1). Owing to this structural dissimilarity, CTP 
must bind to a site distinct from the active site where substrate binds. Such 
sites are called allosteric (from the Greek allos, ‘‘other,’' and stereos, ‘‘struc- 
ture’') or regulatory sites. CTP is an example of an allosteric inhibitor. In 
ATCase (but not all allosterically regulated enzymes), the catalytic sites and 
the regulatory sites are on separate polypeptide chains. 

10.1.1 ACTase Consists of Separable Catalytic 
and Regulatory Subunits 

What is the evidence that ATCase has distinct regulatory and catalytic sites? 
ATCase can be literally separated into regulatory and catalytic subunits by 
treatment with a mercurial compound such as p-hydroxymercuribenzoate, 
which reacts with sulfhydryl groups (Figure 10.3). The results of ultracen- 
trifugation studies carried out by Gerhart and Howard Schachman showed 
that p-hydroxymercuribenzoate dissociates ATCase into two kinds of sub¬ 
units (Figure 10.4). The sedimentation coefficient of the native enzyme is 
11 .6S, whereas those of the dissociated subunits are 2.8S and 5.8S, indicat- 
ing subunits of different size. The subunits can be readily separated by ion- 
exchange chromatography because they differ markedly in charge (Section 
4.1.3) or by centrifugation in a sucrose density gradient because they differ 
in size (Section 4.1.6). Furthermore, the attachedp-mercuribenzoate groups 
can be removed from the separated subunits by adding an excess of mer- 
captoethanol. The isolated subunits provide materiais that can be used to 
investigate and characterize the individual subunits and their interactions 
with one another. 

The larger subunit is called the catalytic (or c) subunit. This subunit dis- 
plays catalytic activity, but it is not affected by CTP. The isolated smaller 
subunit can bind CTP, but has no catalytic activity. Hence, that subunit is 
called the regulatory (or r) subunit. The catalytic subunit, which consists of 
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FIGURE 10.2 CTP inhibits ATCase. 

Cytidine triphosphate, an end product of 
the pyrimidine synthesis pathway, inhibits 
aspartate transcarbamoylase despite having 
littie structural similarity to reactants or 
Products. 
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FIGURE 10.3 Modifícatíon of cysteíne 
residues. p-Hydroxymercuribenzoate reacts 
with crucial cysteine residues in aspartate 
transcarbamoylase. 


FIGURE 10.4 Ultracentrífugation 
studies of ATCase. Sedimentation 
velocity patterns of (A) native ATCase 
and (B) the enzyme after treatment with 
p-hydroxymercuribenzoate show that the 
enzyme can be dissociated into regulatory 
and catalytic subunits. [After J. C. Gerhart and 
H. K. Schachman. Biochemistry 4(1965); 1054.] 
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three chains (34 kd each), is referred to as C 3 . The regulatory subunit, which 
consists of two chains (17 kd each), is referred to as r 2 . The catalytic and 
regulatory subunits combine rapidly when they are mixed. The resulting 
complex has the same structure, Cere, as the native enzyme: two catalytic 
trimers and three regulatory dimers. 

2 C 3 + 3 r 2 -> c^r^ 

Furthermore, the reconstituted enzyme has the same allosteric properties 
as the native enzyme. Thus, ATCase is composed of discrete catalytic and 
regulatory subunits, which interact in the native enzyme to produce its al¬ 
losteric behavior. 


10.1.2 Allosteric Interactions in ATCase Are Mediated 
by Large Changes in Quaternary Structure 

How can the binding of CTP to a regulatory subunit influence reactions at 
the active site of a catalytic subunit? Significant clues have been provided by 
the determination of the three-dimensional structure of ATCase in various 
forms by x-ray crystallography in the laboratory of William Lipscomb. The 
structure of the enzyme without any ligands bound to it confirms the over- 
all structure of the enzyme. Two catalytic trimers are stacked one on top of 
the other, linked by three dimers of the regulatory chains (Figure 10.5). There 
are significant contacts between the two catalytic trimers: each r chain within 
a regulatory dimer interacts with a c chain within a catalytic trimer through 
a structural domain stabilized by a zinc ion bound to four cysteine residues. 


FIGURE 10.5 Structure of ATCase. 

(A) The quaternary structure of aspartate 
transcarbamoylase as viewed from the top. 
The schematic drawing at the right is a 
simplified representation of the relationships 
between subunits. A single trimer [catalytic 
(c) chains, shown in orange and yellow] is 
visible; in this view, the second trimer is 
hidden behind the one visible. (B) A side 
view of the complex. 
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FIGURE 10.6 PALA, a bísubstrate 
analog. (Top) Nucleophilic attack by the 
amino group of aspartate on the carbonyl 
carbon atom of carbamoyl phosphate 
generates an intermediate on the pathway 
to the formation of A/-carbamoylaspartate. 
(Bottom) A/-(Phosphonacetyl)-L-aspartate 
(PALA) is an analog of the reaction 
intermediate and a potent competitive 
inhibitor of aspartate transcarbamoylase. 


The ability of p-hydroxymercuribenzoate to dissociate the catalytic and reg- 
ulatory subunits is related to the ability of mercury to bind strongly to the 
cysteine residues, displacing the zinc and destabilizing this domain. 

To understand the mechanism of allosteric regulation, it is crucial to lo- 
cate each active site and each regulatory site in the three-dimensional struc- 
ture. To locate the active sites, the enzyme was crystallized in the presence 
of ]V-(phosphonacetyl)-L-aspartate (PALA), a bisubstrate analog (an ana¬ 
log of the two substrates) that resembles an intermediate along the path¬ 
way of catalysis (Figure 10.6). PALA is a potent competitive inhibitor of 
ATCase; it binds to and blocks the active sites. The structure of the 
ATCase-PALA complex reveals that PALA binds at sites lying at the 
boundaries between pairs of c chains within a catalytic trimer (Figure 10.7). 
Note that, though most of the residues belong to one subunit, several key 
residues belong to a neighboring subunit. Thus, because the active sites are 
at the subunit interface, each catalytic trimer contributes three active sites 
to the complete enzyme. Suitable amino acid residues are available in the 
active sites for recognizing all features of the bisubstrate analog, including 
the phosphate and both carboxylate groups. 


FIGURE 10.7 The active site of ATCase. 

Some of the crucial active-site residues are 
shown binding to the inhibitor PALA. The 
active site is composed mainly of residues 
from one subunit, but an adjacent subunit 
aiso contributes important residues (boxed 
in green). 
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FIGURE 10.8 The T-to-R State transítion 
in ATCase. Aspartate transcarbamoylase 
exists in two conformations: a compact, 
relatively inactive form called the tense (T) 
State and an expanded form called the 
relaxed (R) State. PALA binding stabilizes 
the R State. 




T State 


FIGURE 10.9 CTP stabilizes the T State. 

The binding of CTP to the regulatory 
subunit of aspartate transcarbamoylase 
stabilizes the T State. 


Further examination of the ATCase-PALA complex reveals a remarkable 
change in quaternary structure on binding of PALA. The two catalytic trimers 
move 12 A farther apart and rotate approximately 10 degrees about their com- 
mon threefold axis of symmetry. Moreover, the regulatory dimers rotate ap¬ 
proximately 15 degrees to accommodate this motion (Figure 10.8). The en- 
zyme literally expands on PALA binding. In essence, ATCase has two 
distinct quaternary forms: one that predominates in the absence of substrate 
or substrate analogs and another that predominates when substrates or analogs 
are bound. These forms will be referred to as the T (for tense) State and the 
R (for relaxed) State, respectively. The T State has lower affinity for substrates 
and, hence, lower catalytic activity than does the R State. In the presence of 
any fixed concentration of aspartate and carbamoyl phosphate, the enzyme 
exists in equilibrium between the T and the R forms. The position of the equi- 
librium depends on the number of active sites that are occupied by substrate. 

Having located the active sites and seen that PALA binding results in 
substantial structural changes in the entire ATCase molecule, we now turn 
our attention to the effects of CTP. Where on the regulatory subunit does 
CTP bind? Determination of the structure of ATCase in the presence of 
CTP reveals a binding site for this nucleotide in each regulatory chain in a 
domain that does not interact with the catalytic subunit (Figure 10.9). The 
question naturally arises as to how CTP can inhibit the catalytic activity of 
the enzyme when it does not interact with the catalytic chain. Each active 
site is more than 50 A from the nearest CTP binding site. The CTP-bound 
form is in the T quaternary state in the absence of bound substrate. 

The quaternary structural changes observed on substrate-analog bind¬ 
ing suggest a mechanism for the allosteric regulation of ATCase by CTP 
(Figure 10.10). The binding of the inhibitor CTP shifts the equilibrium 
toward the T state, decreasing the net enzyme activity and reducing the rate 
of iV-carbamoylaspartate generation. This mechanism for allosteric regula¬ 
tion is referred to as the concerted mechanism because the change in the en- 


FIGURE 10.10 The R state and the 
T State are in equilibrium. Even in the 
absence of any substrate or regulators, 
aspartate transcarbamoylase exists in an 
equilibrium between the R and the T States. 
Under these conditions, the T state is 
favored by a factor of approximately 200. 


T State 

(less active) 



Favored by CTP binding 


R State 

(more active) 



Favored by substrate binding 







zyme is ‘‘all or none''; the entire enzyme is converted from T into R, af- 
fecting all of the catalytic sites equally. The concerted mechanism stands in 
contrast with the sequential mechanism, which will be discussed shortly. 

10.1.3 Allosterically Regulated Enzymes Do Not Follow 
Michaelis-Menten Kinetics 

Allosteric enzymes are distinguished by their response to substrate con- 
centration in addition to their susceptibility to regulation by other mole- 
cules. Examining the rate of product formation as a function of substrate 
concentration can be a source of further insights into the mechanism of reg¬ 
ulation of AT Case (Figure 10.11). The curve differs from that expected for 
an enzyme that folio ws Michaelis-Menten kinetics. The observed curve is 
referred to as sigmoid because it resembles an ‘‘S.’' How can we explain this 
kinetic behavior in light of the structural observations? In the absence of 
substrate, the enzyme exists almost entirely in the T State. However, the 
binding of substrate molecules to the enzyme shifts the enzyme toward the 
R State. A transition from T to R favored by substrate binding to one site 
will increase the enzymatic activity of the remaining five sites, leading to an 
overall increase in enzyme activity. This important property is called coop- 
erativity because the subunits cooperate with one another. If one subunit 
switches conformation, they all do. The sigmoid curve can be pictured as a 
composite of two Michaelis-Menten curves, one corresponding to the T 
State and the other to the R State. An increase in substrate concentration fa- 
vors a transition from the T-state curve to the R-state curve (Figure 10.12). 

The importance of the changes in quaternary structure in determining 
the sigmoidal curve is illustrated nicely by studies of the isolated catalytic 
trimer, freed by p-hydroxymercuribenzoate treatment. The catalytic sub¬ 
unit shows Michaelis-Menton kinetics with kinetic parameters that are in- 
distinguishable from those deduced for the R State. Thus, the term tense is 
apt: in the T state, the regulatory dimers hold the two catalytic trimers suf- 
ficiently dose to one other that key loops on their surfaces collide and in¬ 
terfere with conformational adjustments necessary for high-affinity sub¬ 
strate binding and catalysis. 

10.1.4 Allosteric Regulators Modulate the T-to-R Equilibrium 

What is the effect of CTP on the kinetic profile of ATCase? CTP increases 
the initial phase of the sigmoidal curve (Figure 10.13). As noted earlier, CTP 
inhibits the activity of ATCase. In the presence of CTP, the enzyme be- 
comes less responsive to the cooperative effects facilitated by substrate bind¬ 
ing; more substrate is required to attain a given reaction rate. Interestingly, 
ATP, too, is an allosteric effector of ATCase. However, the effect of ATP 
is to increase the reaction rate at a given aspartate concentration (Figure 
10.14). At high concentrations of ATP, the kinetic profile shows a less- 
pronounced sigmoidal behavior. Note that such sigmoidal behavior has an 



FIGURE 10.13 Effect of CTP on ATCase 
kinetics. Cytidine triphosphate (CTP) 
stabilizes the T State of aspartate 
transcarbamoylase, making it more difficult 
for substrate binding to convert the enzyme 
into the R State. As a result, the curve is 
shifted to the right, as shown in red. 
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FIGURE 10.11 ATCase displays 
sigmoidal kinetics. A plot of product 
formation as a function of substrate 
concentration produces a sigmoidal curve 
because the binding of substrate to one 
active site favors the conversion of the 
entire enzyme into the R State, increasing 
the activity at the other active sites. Thus, 
the active sites show cooperativity. 



[Aspartate] —> 


FIGURE 10.12 Basis for the sigmoidal 
curve. The generation of the sigmoidal 
curve by the property of cooperativity can 
be understood by imagining an allosteric 
enzyme as a mixture of two Michaelis- 
Menten enzymes, one with a high value 
of /Cm that corresponds to the T State 
and another with a low value of 
that corresponds to the R state. As the 
concentration of substrate is increased, 
the equilibrium shifts from the T state to 
the R State, which resuits in a steep rise 
in activity with respect to substrate 
concentration. 
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FIGURE 10.14 Effect of ATP on ATCase 
kínetícs. ATP is an allosteric activator of 
aspartate transcarbamoylase because it 
stabilizes the R State, making it easier for 
substrate to bind. As a result, the curve is 
shifted to the left, as shown in blue. 


additional consequence: in the concentration range where the T-to-R tran- 
sition is taking place, the curve depends quite steeply on the substrate con¬ 
centration. The effects of substrates on allosteric enzymes are referred to as 
homotropic effects (from the Greek homós, ‘‘same''). In contrast, the effects 
of nonsubstrate molecules on allosteric enzymes (such as those of CTP and 
ATP on ATCase) are referred to as heterotropic effects (from the Greek 
héteros, ‘‘different’'). 

The increase in ATCase activity in response to increased ATP concen¬ 
tration has two potential physiological explanations. First, high ATP con¬ 
centration signals a high concentration of purine nucleotides in the cell; the 
increase in ATCase activity will tend to balance the purine and pyrimidine 
pools. Second, a high concentration of ATP indicates that there is signifi- 
cant energy stored in the cell to promote mRNA synthesis and DNA 
replication. 

10.1.5 The Concerted Model Can Be Formulated 
in Quantitative Terms 


CONCEPTUAL INSIGHTS, Cooperatíve Binding and Kínetícs. Interactive 
graphing activities allow you to experiment with changes in the parameters and 
conditions of the MWC model in order to increase your understanding of the model and 
its implications for cooperatíve binding and kinetics. 


The concerted model was first proposed by Jacques Monod, Jeffries 
Wyman, and Jean-Pierre Changeux; hence, it is often referred to as the 
MWC model. This model can be formulated in quantitative terms. Con- 
sider an enzyme with n identical active sites. Suppose that the enzyme ex- 
ists in equilibrium between a T form with a low affinity for its substrate and 
an R form with a high affinity for the substrate. We can define L as the equi¬ 
librium constant between the R and the T forms; c as the ratio of the affini- 
ties of the two forms for the substrate, S, measured as dissociation constants; 
and a as the ratio of substrate concentration to the dissociation constant Kr. 


R^^T 

SitCR + S V SitCR — S 
Site^ + S ^ " Site-r — S 
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The fraction of active sites bound to substrate (fractional saturation, Ys) 
is given by 

^ _ a(l - a)^-^ + Lca(l + 

^ “ (1 + a)" + L(1 + cay 


where n is the number of sites in the enzyme. 

This quantitative model can be used to examine the data from ATCase, 
for which n = 6. Excellent agreement with experimental data is obtained 

















with L ~ 200 and c ~ 0.1. Thus, in the absence of bound substrate, the equi- 
librium favors the T form by a factor of 200 (i.e., only 1 in 200 molecules 
is in the R form), and the affinity of the R form for substrate is approxi- 
mately 10 times as high as that of the T form. As substrate binds to each 
active site, the equilibrium shifts toward the R form. For example, with 
these parameters, when half the active sites (three of six) are occupied by 
substrate, the equilibrium has shifted so that the ratio of T to R is now 1 to 
5; that is, nearly all the molecules are in the R form. 

The effects of CTP and ATP can be modeled simply by changing the 
value of L. For the CTP-saturated form, the value of L increases to 1250. 
Thus, it takes more substrate to shift the equilibrium appreciably to the 
R form. For the ATP saturated form, the value of L decreases to 70 
(Figure 10.15). 

10.1.6 Sequential Models Aiso Can Account for Allosteric Effects 

In the concerted model, an allosteric enzyme can exist in one of only two 
States, T and R; no intermediate States are allowed. An alternative, first pro- 
posed by Daniel Koshland, posits that sequential changes in structure take 
place within an oligomeric enzyme as active sites are occupied. The bind- 
ing of substrate to one site influences the substrate affinity of neighboring 
active sites without necessarily inducing a transition encompassing the entire 
enzyme (Figure 10.16). An important feature of sequential in contrast with 
concerted models is that the former can account for negative cooperativity, 
in which the binding of substrate to one active site decreases the affinity of 
other sites for substrate. The results of studies of a number of allosteric pro- 
teins suggest that most behave according to some combination of the se¬ 
quential and cooperative models. 


H269| - 

Hemoglobin 



FIGURE 10.15 Quantitative descriptíon 

of the MWC model. Fractíonal activity, 

K is the fraction of active sites bound to 
substrate and is directiy proportional to 
reaction velocity; a is the ratio of [S] to 
the dissociation constant of S with the 
enzyme in the R State; L is the ratio of the 
concentration of enzyme in the T State to 
that in the R State. The binding of the 
regulators ATP and CTP to ATCase changes 
the value of L and thus the response to 
substrate concentration. 



FIGURE 10.16 Símple sequential model for a tetrameric allosteric enzyme. The 

binding of a ligand (L) to a subunit changes the conformation of that particular subunit 
from the T (square) to the R (circie) form. This transition affects the affinity of the other 
subunits for the ligand. 


10.2 HEMOGLOBIN TRANSPORTS OXYGEN EFFICIENTLY 
BY BINDING OXYGEN COOPERATIVELY 

Allostery is a property not limited to enzymes. The basic principies of al- 
lostery are also well illustrated by the oxygen-transport protein hemoglobin 
(Section 7.2). The binding of oxygen to hemoglobin isolated from red blood 
cells displays marked sigmoidal behavior (similar to that observed for the 
activity of ATCase, as a function of substrate concentration), which is in- 
dicative of cooperation between subunits (Figure 10.17). What is the phys- 
iological significance of the cooperative binding of oxygen by hemoglobin? 
Oxygen must be transported in the blood from the lungs, where the partial 
pressure of oxygen (PO 2 ) is relatively high (approximately 100 torr), to the 
tissues, where the partial pressure of oxygen is much lower (typically 20 torr). 
Let us consider how the cooperative behavior represented by the sigmoidal 


Tissues Lungs 



FIGURE 10.17 Cooperativity enhances 
oxygen delivery by hemoglobin. 

Because of cooperativity between O 2 - 
binding sites, hemoglobin delivers more 
O 2 to tissues than would a noncooperative 
protein (p 02 , partial pressure of oxygen.) 
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Torr— 

A unit of pressure equal to that exerted 
by a column of mercury 1 mm high at 
0°C and standard gravity (1 mm Hg). 
Named after Evangelista Torricelli 
(1608-1647), inventor of the mercury 
barometer. 



Heme 

(Fe-protoporphyrin IX) 



FIGURE 10.18 Posítíon of íron ín 
deoxyhemoglobín. The iron ion lies 
slightly outside the plane of the porphyrin 
in heme. 


curve leads to efficient oxygen transport. In the lungs, hemoglobin becomes 
nearly saturated with oxygen such that 98% of the oxygen-binding sites are 
occupied. When hemoglobin moves to the tissues, the saturation levei drops 
to 32%. Thus, a total of 98 — 32 = 66% of the potential oxygen-binding sites 
contribute to oxygen transport. In comparison, for a hypothetical noncoop- 
erative transport protein, the most oxygen that can be transported from a 
region in whichp02 is 100 torr to one in which it is 20 torr is 63 — 25 = 38% 
(see Figure 10.17). Thus, the cooperative binding of oxygen by hemoglobin en- 
ables it to deliver 1.7 times as much oxygen as it would if the sites were inde- 
pendent. The homotropic regulation of hemoglobin by its ligand oxygen dra- 
matically increases its physiological oxygen-carrying capacity. 

10.2.1 Oxygen Binding Induces Substantial Structural Changes 
at the Iron Sites in Hemoglobin 

The results of structural studies pioneered by Max Perutz revealed the 
structure of hemoglobin in various forms. Human hemoglobin A, present 
in adults, consists of four subunits: two a subunits and two |3 subunits. The 
a and |3 subunits are homologous and have similar three-dimensional struc- 
tures (Section 7.4). The capacity of hemoglobin to bind oxygen depends on 
the presence of a bound prosthetic group called heme. The heme group is 
responsible for the distinctive red color of blood. The heme group consists 
of an organic component and a central iron atom. The organic component, 
called protoporphyrin, is made up of four pyrrole rings linked by methene 
bridges to form a tetrapyrrole ring. Four methyl groups, two vinyl groups, 
and two propionate side chains are attached. 

The iron atom lies in the center of the protoporphyrin, bonded to the 
four pyrrole nitrogen atoms. Under normal conditions, the iron is in the fer- 
rous (Fe^^) oxidation state. The iron ion can form two additional bonds, 
one on each side of the heme plane. These binding sites are called the fifth 
and sixth coordination sites. In hemoglobin, the fifth coordination site is 
occupied by the imidazole ring of a histidine residue from the protein. In 
deoxyhemoglobin, the sixth coordination site remains unoccupied. The iron 
ion lies approximately 0.4 À outside the porphyrin plane because iron, in 
this form, is slightly too large to fit into the well-defined hole within the 
porphyrin ring (Figure 10.18). 

The binding of the oxygen molecule at the sixth coordination site of the 
iron ion substantially rearranges the electrons within the iron so that the ion 
becomes effectively smaller, allowing it to move into the plane of the por¬ 
phyrin (Figure 10.19). This change in electronic structure is paralleled by 



O2 



FIGURE 10.19 Oxygen binding initiates structural changes. The íron ion moves into the 
plane of the heme on oxygenation. The proximal histidine is pulled along with the iron ion. 
















changes in the magnetic properties of hemoglobin, which are the basis for 
functional magnetic resonance imaging (fMRI; Section 32.1.3). Indeed, the 
structural changes that take place on oxygen binding were anticipated by 
Linus Pauling, based on magnetic measurements in 1936, nearly 25 years 
before the three-dimensional structure of hemoglobin was elucidated. 

10.2.2 Oxygen Binding Markedly Changes the Quaternary 
Structure of Hemoglobin 

The three-dimensional structure of hemoglobin is best described as a pair 
of identical aP dimers and oi 2 ^ 2 ) that associate to form the hemoglo¬ 
bin tetramer (Figure 10.20). In deoxyhemoglobin, these aP dimers are 
linked by an extensive interface, which includes, among other regions, the 
carboxyl terminus of each chain. The heme groups are well separated in the 
tetramer with iron-iron distances ranging from 24 to 40 A. The deoxy form 
corresponds to the T State in the context of either the concerted or the se- 
quential model for hemoglobin cooperativity. On oxygen binding, there are 
substantial changes in quaternary structure that correspond to the T-to-R 
State transition (Figure 10.21). 



FIGURE 10.20 Quaternary 
structure of hemoglobin. Hemoglobin, 
which is composed of two a chains and 
two p chains, functions as a pair of ap 
dimers. 



T State 


R State 




FIGURE 10.21 Transition from T to R State in hemoglobin. On oxygenation. 


one pair of ap subunits shifts with respect to the other by a rotation of 15 degrees. 


The aiPi and a 2^2 dimers rotate approximately 15 degrees with respect to 
one another. The dimers themselves are relatively unchanged, although lo- 
calized conformational shifts do occur. Thus, the interface between the a^Pi 
and oí 2^2 dimers is most effected by this structural transition. 

How does oxygen binding lead to the structural transition from the T 
State to the R state? When the iron ion moves into the plane of the por- 
phyrin, the histidine residue bound in the fifth coordination site moves with 
it. This histidine residue is part of an a helix, which also moves (Figure 
10.22). The carboxyl terminal end of this a helix lies in the interface be¬ 
tween the two aP dimers. Consequently, the structural transition at the iron 
ion is directly transmitted to the other subunits. The rearrangement of the 
dimer interface pr o vides a pathway for communication between subunits, 
enabling the cooper ative binding of oxygen. 

Is the cooperative binding of oxygen by hemoglobin best described 
by the concerted or the sequential model? Neither model in its pure form 
fully accounts for the behavior of hemoglobin. Instead, a combined model 



Oxyhemoglobin 

FIGURE 10.22 Conformational 
changes in hemoglobin. The movement 
of the iron ion on oxygenation brings the 
iron-associated histidine residue toward 
the porphyrin ring. The corresponding 
movement of the histidine-containing a 
helix alters the interface between the ap 
pairs, instigating other structural changes. 
For comparison, the deoxyhemoglobin 
structure is shown in gray behind the 
oxyhemoglobin structure in color. 





-1 272 I- 

CHAPTER 10 • Regulatory Strategies: 

Enzymes and Hemoglobin 



2,3-Bisphosphoglycerate 

(2,3-BPG) 


FIGURE 10.23 Mode of bindíng 
of 2,3-BPG to human 
deoxyhemoglobín. 2,3-BPG binds in the 
central cavity of deoxyhemoglobin. There, 
it interacts with three positively charged 
groups on each p chain. 


is required. Hemoglobin behavior is concerted in that hemoglobin with 
three sites occupied by oxygen is in the quaternary structure associated with 
the R State. The remaining open binding site has an affinity for oxygen more 
than 20 -fold as great as that of fully deoxygenated hemoglobin binding its 
first oxygen. However, the behavior is not fully concerted, because hemo¬ 
globin with oxygen bound to only one of four sites remains in the T-State 
quaternary structure. Yet, this molecule binds oxygen 3 times as strongly as 
does fully deoxygenated hemoglobin, an observation consistent only with a 
sequential model. These results highlight the fact that the concerted and se- 
quential models represent idealized limiting cases, which real systems may 
approach but rarely attain. 

10.2.3 Tuning the Oxygen Affinity of Hemoglobin: The Effect 
of 2,3-Bisphosphoglycerate 

Examination of the oxygen binding of hemoglobin fully purified from red 
blood cells revealed that the oxygen affinity of purified hemoglobin is much 
greater than that for hemoglobin within red blood cells. This dramatic dif- 
ference is due to the presence within these cells of 2,3-bisphosphoglycerate 
(2,3-BPG) (also known as 2,3-diphosphoglycerate or 2,3-DPG). This 
highly anionic compound is present in red blood cells at approximately the 
same concentration as that of hemoglobin (~2 mM). Without 2,3-BPG, 
hemoglobin would be an extremely inefficient oxygen transporter, releas- 
ing only 8 % of its cargo in the tissues. 

How does 2,3-BPG affect oxygen affinity so significantly? Examination 
of the crystal structure of deoxyhemoglobin in the presence of 2,3-BPG re- 
veals that a single molecule of 2,3-BPG binds in a pocket, present only in 
the T form, in the center of the hemoglobin tetramer (Figure 10.23). On 
T-to-R transition, this pocket collapses. Thus, 2,3-BPG binds preferentially 
to deoxyhemoglobin and stabilizes it, effectively reducing the oxygen affin¬ 
ity. In order for the structural transition from T to R to take place, the bonds 
between hemoglobin and 2,3-BPG must be broken and 2,3-BPG must be 
expelled. 



Y 2,3-BPG binding to hemoglobin has other crucial physiological con- 
sequences. The globin gene expressed by fetuses differs from that 
expressed by human adults; fetal hemoglobin tetramers include two a 
chains and two 7 chains. The 7 chain, a result of another gene duplication, 
is 72% identical in amino acid sequence with the (3 chain. One noteworthy 
change is the substitution of a serine residue for His 143 in the |3 chain of 
the 2,3-BPG-binding site. This change removes two positive charges 
from the 2,3-BPG-binding site (one from each chain) and reduces the 
affinity of 2,3-BPG for fetal hemoglobin, thereby increasing the oxygen- 



binding affinity of fetal hemoglobin relative to that of maternal (adult) he- 
moglobin (Figure 10.24). This difference in oxygen affinity allows oxygen 
to be effectively transferred from maternal to fetal red cells. We see again 
an example of where gene duplication and specialization produced a ready 
solution to a biological challenge—in this case, the transport of oxygen 
from mother to fetus. 

10.2.4 The Bohr Effect: Hydrogen lons and Carbon Dioxide 
Promote the Release of Oxygen 

Rapidly metabolizing tissues, such as contracting muscle, have a high need 
for oxygen and generate large amounts of hydrogen ions and carbon diox¬ 
ide as well (Sections 16.1.9 and 17.1). Both of these species are heterotropic 
effectors of hemoglobin that enhance oxygen release. The oxygen affinity 
of hemoglobin decreases as pH decreases from the value of 7.4 found in the 
lungs (Figure 10.25). Thus, as hemoglobin moves into a region of low pH, 
its tendency to release oxygen increases. For example, transport from the 
lungs, with pH 7.4 and an oxygen partial pressure of 100 torr, to active mus¬ 
cle, with a pH of 7.2 and an oxygen partial pressure of 20 torr, results in a 
release of oxygen amounting to 77% of total carrying capacity. Recall that 
only 66% of the oxygen would be released in the absence of any change in 
pH. In addition, hemoglobin responds to carbon dioxide with a decrease in 
oxygen affinity, thus facilitating the release of oxygen in tissues with a high 
carbon dioxide concentration. In the presence of carbon dioxide at a partial 
pressure of 40 torr, the amount of oxygen released approaches 90% of the 
maximum carrying capacity. Thus, the heterotropic regulation of hemoglobin 
by hydrogen ions and carbon dioxide further increases the oxygen-transporting 
efficiency of this magnificent allosteric protein. 

The regulation of oxygen binding by hydrogen ions and carbon dioxide 
is called the Bohr effect after Christian Bohr, who described this phenome- 
non in 1904. The results of structural and chemical studies have revealed 
much about the chemical basis of the Bohr effect. At least two sets of Chem¬ 
ical groups are responsible for the effect of protons: the amino termini and 
the side chains of histidines |3146 and a 122, which have pK^ values near 
pH 7. Consider histidine |3146. In deoxyhemoglobin, the terminal carboxyl- 
ate group of |3146 forms a salt bridge with a lysine residue in the a subunit 
of the other a (3 dimer. This interaction locks the side chain of histidine |3146 
in a position where it can participate in a salt bridge with negatively charged 
aspartate 94 in the same chain, provided that the imidazole group of the his¬ 
tidine residue is protonated (Figure 10.26). At high pH, the side chain of 
histidine |3146 is not protonated and the salt bridge does not form. As the 
pH drops, however, the side chain of histidine |3146 becomes protonated, the 



FIGURE 10.24 Oxygen affinity of fetal 
red blood cells. Fetal red blood cells 
have a higher oxygen affinity than that of 
maternal red blood cells because fetal 
hemoglobin does not bind 2,3-BPG as 
well as maternal hemoglobin does. 


— pH 7.4, no CO 2 


pH 7.2, no CO 2 
— pH 7.2, 40 torr CO 2 
Tissues Lungs 



FIGURE 10.25 Effect of pH and CO 2 
concentration on the oxygen affinity 
of hemoglobin. Lowering the pH from 
7.4 (red cu n/e) to 7.2 (blue cu n/e) results 
in the release of O 2 from oxyhemoglobin. 
Raising the CO 2 partial pressure from 
0 to 40 torr (purple cun/e) aiso favors 
the release of O 2 from oxyhemoglobin. 



FIGURE 10.26 Chemical basis of the 
Bohr effect. In deoxyhemoglobin, shown 
here, three amino acid residues form two 
salt bridges that stabilize the T quaternary 
structure. The formation of one of the 
salt bridges depends on the presence 
of an added proton on histidine pi46. 

The proximity of the negative charge on 
aspartate 94 favors protonation of histidine 
pi46 in deoxyhemoglobin. 
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salt bridge with aspartate (394 forms, and the quaternary structure charac- 
teristic of deoxyhemoglobin is stabilized, leading to a greater tendency for 
oxygen to be released at actively metabolizing tissues. No significant change 
takes place in oxy hemoglobin o ver the same pH range. 

Carbon dioxide also stabilizes deoxyhemoglobin by reacting with the 
terminal amino groups to form carbamate groups, which are negatively 
charged, in contrast with the neutral or positive charges on the free amino 
groups. The amino termini lie at the interface between the a(3 dimers, and 
these negatively charged carbamate groups participate in salt-bridge inter- 
actions, characteristic of the T-state structure, which stabilize deoxyhemo¬ 
globin s structure and favor the release of oxygen. 

Hemoglobin with bound carbon dioxide and hydrogen ions is carried in 
the blood back to the lungs, where it releases the hydrogen ions and carbon 
dioxide and rebinds oxygen. Thus, hemoglobin helps to transport hydro¬ 
gen ions and carbon dioxide in addition to transporting oxygen. However, 
transport by hemoglobin accounts for only about 14% of the total transport 
of these species; both hydrogen ions and carbon dioxide are also transported 
in the blood as bicarbonate (HCOa”) formed spontaneously or through 
the action of carbonic anhydrase (Section 9.2), an abundant enzyme in red 
blood cells. 


10.3 ISOZYMES PROVIDE A MEANS OF REGULATION 
SPECIFIC TO DISTINCT TISSUES AND DEVELOPIVIENTAL 
STAGES 

Isozymes or isoenzymes, are enzymes that differ in amino acid sequence yet 
catalyze the same reaction. Usually, these enzymes display different kinetic 
parameters, such as Km, or differ ent regulatory properties. They are en- 
coded by different genetic loci, which usually arise through gene duplica- 
tion and divergence (Section 2.2.5). Isozymes differ from allozymes, which 
are enzymes that arise from allelic variation at one gene locus. Isozymes can 
often be distinguished from one another by biochemical properties such as 
electrophoretic mobility. 

The existence of isozymes permits the fine-tuning of metabolism to meet the 
particular needs of a given tissue or developmental stage. Consider the ex- 
ample of lactate dehydrogenase (LDH), an enzyme that functions in anaer- 
obic glucose metabolism and glucose synthesis. Human beings have two 
isozymic polypeptide chains for this enzyme: the H isozyme highly ex- 
pressed in heart and the M isozyme found in skeletal muscle. The amino 
acid sequences are 75% identical. The functional enzyme is tetrameric, and 
many different combinations of the two subunits are possible. The H 4 
isozyme, found in the heart, has a higher affinity for substrates than does 
the M 4 isozyme. The two isozymes also differ in that high leveis of pyru- 
vate allosterically inhibit the H 4 but not the M 4 isozyme. The other com¬ 
binations, such as H 3 M, have intermediate properties depending on the ra- 
tio of the two kinds of chains. We will consider these isozymes in their 
biological context in Chapter 16. 

T The M 4 isozyme functions optimally in an anaerobic environment, 
whereas the H 4 isozyme does so in an aerobic environment. Indeed, 
the proportions of these isozymes are altered in the course of development 
of the rat heart as the tissue switches from an anaerobic environment to an 
aerobic one (Figure 10.27A). Figure 10.27B shows the tissue-specific forms 
of lactate dehydrogenase in adult rat tissues. The appearance of some 





(A) (B) Red 








Heart 

Kidney 

blood cell 

Brain 

Leukocyte 

Muscle 

Liver 

LDH-l 




■ ■ 

■ ■ 

M 4 




— 

— 

— 

— 

LDH-2 





HIVI3 

— 

— 



— 


— 

LDH-3 





H2M2 

— 







LDH-4 


• • 

• • 


H3M 



















LDH-5 

• • 

• • 




H4 









-9 

-5 

-1 +12 

+21 Adult 










FIGURE 10.27 Isozymes of lactate dehydrogenase. (A) The rat heart LDH isozyme 
profile changes in the course of development The H isozyme is represented by squares and 
the M isozyme by circies. The negative and positive numbers denote the days before and 
after birth, respectively. (B) LDH isozyme content varies by tissue. [(A) After W.-H. Li, Molecular 
Evolution (Sinauer, 1997), p. 283; (B) After K. Urich, Comporative Animal Biochemistry (Springer Verlag, 
1990), p. 542.] 


isozymes in the blood is indicative of tissue damage and can be used for 
clinicai diagnosis. For instance, an increase in serum leveis of H 4 relative 
to H 3 M is an indication that a myocardial infarction, or heart attack, has 
occurred. 


10.4 COVALENT MODIFICATION IS A MEANS 
OF REGULATING ENZYME ACTIVITY 

The covalent attachment of another molecule can modify the activity of en- 
zymes and many other proteins. In these instances, a donor molecule pro- 
vides a functional moiety that modifies the properties of the enzyme. Most 
modifications are reversible. Phosphorylation and dephosphorylation are 
the most common but not the only means of covalent modification. His- 
tones—proteins that assist in the packaging of DNA into chromosomes as 
well as in gene regulation—are rapidly acetylated and deacetylated in vivo 
(Section 31 .X.X). More heavily acetylated histones are associated with genes 
that are being actively transcribed. The acetyltransferase and deacetylase 
enzymes are themselves regulated by phosphorylation, showing that the co¬ 
valent modification of histones may be controlled by the covalent modifi¬ 
cation of the modifying enzymes. 

Modification is not readily reversible in some cases. Some proteins in 
signal-transduction pathways, such as Ras and Src (a protein tyrosine ki- 
nase), are localized to the cytoplasmic face of the plasma membrane by the 
irreversible attachment of a lipid group (Section 12.5.3). Fixed in this loca- 
tion, the proteins are better able to receive and transmit information that is 
being passed along their signaling pathways (Chapter 15). The attachment 
of ubiquitin, a protein comprising 72 amino acids, is a signal that a protein 
is to be destroyed, the ultimate means of regulation (Chapter 23). Cyclin, 
an important protein in cell-cycle regulation, must be ubiquitinated and de¬ 
stroyed before a cell can enter anaphase and proceed through the cell cycle 
(TablelO.l). 

Virtually all the metabolic processes that we will examine are regulated 
in part by covalent modification. Indeed, the allosteric properties of many 
enzymes are modified by covalent modification. Table 10.1 lists some of the 
common covalent modifications. 
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TABLE 10.1 Common covalent modifications of protein activity 


Modification 

Donor molecule 

Example 

of modified protein 

Protein function 

Phosphorylation 

ATP 

Glycogen phosphorylase 

Glucose 

homeostasis; energy 
transduction 

Acetylation 

Acetyl GoA 

Histones 

DNA packing; transcription 

Myristoylation 

Myristoyl CoA 

Src 

Signal transduction 

ADP-ribosylation 

NAD 

RNA polymerase 

Transcription 

Farnesylation 

Farnesyl pyrophosphate 

Ras 

Signal transduction 

7 - Carboxylation 

HCO3- 

Thrombin 

Blood clotting 

Sulfation 

3' - Phosphoadenosine- 5' - 
phosphosulfate 

Fibrinogen 

Blood-clot formation 

Ubiquitination 

Ubiquitin 

Cyclin 

Gontrol of cell cycle 


10.4.1 Phosphorylation Is a Highly Effective Means 
of Regulating the Activities of Target Proteins 

The activities of many enzymes, membrane channels, and other target pro¬ 
teins are regulated by phosphorylation, the most prevalent reversible cova¬ 
lent modification. Indeed, we will see this regulatory mechanism in virtu- 
ally every metabolic process in eukaryotic cells. The enzymes catalyzing 
phosphorylation reactions are called protein kinases, which constitute one of 
the largest protein families known, with more than 100 homologous en¬ 
zymes in yeast and more than 550 in human beings. This multiplicity of 
enzymes allows regulation to be fine-tuned according to a specific tissue, 
time, or substrate. 

The terminal ( 7 ) phosphoryl group of ATP is transferred to specific 
serine and threonine residues by one class of protein kinases and to specific 
tyrosine residues by another. 
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Source: After D. Fell, Understanding the Control of Metabolism (Portland Press, 1997), 
Table 7.2. 
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TABLE 10.2 Examples of serine and threonine kinases and their 
activating signals 


Signal 

Enzyme 

Cyclic nucleotides 

Cyclic AMP-dependent protein kinase 

Cyclic GMP-dependent protein kinase 

Ca^"^ and calmodulin 

Ca^"^-calmodulin protein kinase 

Phosphorylase kinase or glycogen synthase kinase 2 

AMP 

AMP-activated kinase 

Diacylglycerol 

Protein kinase C 

Metabolic 

Many target specific enzymes, such as pyruvate 

intermediates 

dehydrogenase kinase and branched-chain 

and other “locaf ’ 
effectors 

ketoacid dehydrogenase kinase 


The acceptors in protein phosphorylation reactions are located inside cells, 
where the phosphoryl-group donor ATP is abundant. Proteins that are 
entirely extracellular are not regulated by reversible phosphorylation. Table 
10.2 lists a few of the known protein kinases. Protein phosphatases reverse 
the effects of kinases by catalyzing the hydrolytic removal of phosphoryl 
groups attached to proteins. 


P 

Protein 

phosphatase 

+ H 2 O -^^ 


0 


0 

Phosphorylated 

protein 


Orthophosphate 

(Pi) 


The unmodified hydroxyl-containing side chain is regenerated and or- 
thophosphate (Pi) is produced. 

It is important to note that phosphorylation and dephosphorylation are 
not the reverse of one another; each is essentially irreversible under physi- 
ological conditions. Furthermore, both reactions take place at negligible 
rates in the absence of enzymes. Thus, phosphorylation of a protein sub- 
strate will take place only through the action of a specific protein kinase and 
at the expense of ATP cleavage, and dephosphorylation will result only 
through the action of a phosphatase. The rate of cycling between the phos- 
phorylated and the dephosphorylated States depends on the relative activi- 
ties of kinases and phosphatases. Note that the net outcome of the two re¬ 
actions is the hydrolysis of ATP to ADP and Pi, which has a AG of 
— 12 kcal moP^ ( — 50 kj moP^) under cellular conditions (Section 14.1.2). 
This highly favorable free-energy change ensures that target proteins cycle 
unidirectionally between unphosphorylated and phosphorylated forms. 

Phosphorylation is a highly effective means of controlling the activity of 
proteins for structural, thermodynamic, kinetic, and regulatory reasons: 

1 . A phosphoryl group adds two negative charges to a modified protein. 
Electrostatic interactions in the unmodified protein can be disrupted and 
new electrostatic interactions can be formed. Such structural changes can 
markedly alter substrate binding and catalytic activity. 


Protein-OH + ATP 


Protein-OPOs^ + ADP 


yH,o 


Protein-OH + HOPO,^- 
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2. A phosphate group can form three or more hydrogen bonds. The tetra- 
hedral geometry of the phosphoryl group makes these hydrogen bonds 
highly directional, allowing for specific interactions with hydrogen-bond 
donors. 

3. The free energy of phosphorylation is large. Of the—12 kcal moP^ 
(— 50 kj moP^) provided by ATP, about half is consumed in making phos¬ 
phorylation irreversible; the other half is conserved in the phosphorylated 
protein. Recall that a free-energy change of 1.36 kcal moP^ (5.69 kJ moP^) 
corresponds to a factor of 10 in an equilibrium constant (Section 14.1.3). 
Hence, phosphorylation can change the conformational equilibrium be- 
tween different functional States by a large factor, of the order of lO"^. 

4. Phosphorylation and dephosphorylation can take place in less than a sec- 
ond or over a span of hours. The kinetics can be adjusted to meet the tim- 
ing needs of a physiological process. 

5. Phosphorylation often evokes highly amplified effects. A single activated 
kinase can phosphorylate hundreds of target proteins in a short interval. 
Further amplification can take place because the target proteins may be 
enzymes, each of which can then transform a large number of substrate 
molecules. 

6. ATP is the cellular energy currency (Chapter 14). The use of this com- 
pound as a phosphoryl group donor links the energy status of the cell to the 
regulation of metabolism. 

Protein kinases vary in their degree of specificity. Dedicated protein ki- 
nases phosphorylate a single protein or several closely related ones. Multi- 
functional protein kinases modify many different targets; they have a wide 
reach and can coordinate diverse processes. Comparisons of amino acid se- 
quences of many phosphorylation sites show that a multifunctional kinase 
recognizes related sequences. For example, the consensus sequence recog- 
nized by protein kinase A is Arg-Arg-X-Ser-Z or Arg-Arg-X-Thr-Z, in 
which X is a small residue, Z is a large hydrophobic one, and Ser or Thr is 
the site of phosphorylation. It should be noted that this sequence is not 
absolutely required. Lysine, for example, can substitute for one of the 
arginine residues but with some loss of affinity. Short synthetic peptides 
containing a consensus motif are nearly always phosphorylated by serine- 
threonine protein kinases. Thus, the primary determinant of specificity is the 
amino acid sequence surrounding the serine or threonine phosphorylation site. 
However, distant residues can contribute to specificity. For instance, 
changes in protein conformation may alter the accessibility of a possible 
phosphorylation site. 

10.4.2 Cyclic AMP Activates Protein Kinase A by Altering the 
Quaternary Structure 

Protein kinases modulate the activity of many proteins, but what leads to 
the activation of a kinase? Activation is often a multistep process initiated 
by hormones (Chapter 15). In some cases, the hormones trigger the forma- 
tion of cyclic AMP, a molecule formed by cyclization of ATP. Cyclic AMP 
serves as an intracellular messenger in mediating the physiological actions 
of hormones, as will be discussed in Chapter 15. The striking finding is that 
most effects of cAMP in eukaryotic cells are achieved through the activation by 
cAMP of a single protein kinase. This key enzyme is called protein kinase A or 
PKA. The kinase alters the activities of target proteins by phosphorylating 
specific serine or threonine residues. As we shall see, PKA provides a clear 
example of the integration of allosteric regulation and phosphorylation. 



Pseudosubstrate 

sequence 




FIGURE 10.28 Regulation of protein 
kínase A. The binding of four molecules 
of cAMP activates protein kinase A by 
dissociating the inhibited holoenzyme 
(R2C2) into a regulatory subunit (R2) and 
two catalytically active subunits (C). 


PKA is activated by cAMP concentrations of the order of 10 nM. The 
activation mechanism is reminiscent of that of aspartate transcarbamoylase. 
Like that enzyme, PKA in muscle consists of two kinds of subunits: a 
49-kd regulatory (R) subunit, which has high affinity for cAMP, and a 
38-kd catalytic (C) subunit. In the absence of cAMP, the regulatory and cat- 
alytic subunits form an R 2 C 2 complex that is enzymatically inactive (Figure 
10.28). The binding of two molecules of cAMP to each of the regulatory 
subunits leads to the dissociation of R 2 C 2 into an R 2 subunit and two C sub¬ 
units. These free catalytic subunits are then enzymatically active. Thus, the 
binding of cAMP to the regulatory subunit relieves its inhibition of the catalytic 
subunit. PKA and most other kinases exist in isozymic forms for fine- 
tuning regulation to meet the needs of a specific cell or developmental stage. 

How does the binding of cAMP activate the kinase? Each R chain con- 
tains the sequence Arg-Arg-Gly-A/a-Ile, which matches the consensus se¬ 
quence for phosphorylation except for the presence of alanine in place of 
serine. In the R 2 G 2 complex, this pseudosubstrate sequence of R occupies the 
catalytic site of C, thereby preventing the entry of protein substrates (see 
Figure 10.28). The binding of cAMP to the R chains allosterically moves 
the pseudosubstrate sequences out of the catalytic sites. The released C 
chains are then free to bind and phosphorylate substrate proteins. 

10.4.3 ATP and the Target Protein Bind to a Deep Cleft in the 
Catalytic Subunit of Protein Kinase A 

Y The three-dimensional structure of the catalytic subunit of PKA 
containing a bound 20-residue peptide inhibitor was determined by 
x-ray crystallography. The 350-residue catalytic subunit has two lobes 
(Figure 10.29). ATP and part of the inhibitor fill a deep cleft between 
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FIGURE 10.29 Protein kinase 
A bound to an inhibitor. Three- 
dimensional structure of a complex of the 
catalytic subunit of protein kinase A and an 
inhibitor bearing a pseudosubstrate 
sequence. The inhibitor (yellow) binds in a 
cleft between the domains of the enzyme. 
The bound ATP is adjacent to the active 
site. 























FIGURE 10.30 Bindíng of pseudosubstrate to proteín 
kínase A. The two arginine side chains of the pseudosubstrate 
form sait bridges with three glutamate carboxylates. Hydrophobic 
interactions are aiso important in the recognition of substrate. 

The isoleucine residue of the pseudosubstrate is in contact with a 
pair of leucine residues of the enzyme. 


the lobes. The smaller lobe makes many contacts with 
ATP-Mg^^, whereas the larger lobe binds peptide 
and contributes the key catalytic residues. Like other 
kinases (Section 16.1.1), the two lobes move closer to 
one another on substrate binding; mechanisms for re- 
stricting this domain closure provides a means for reg- 
ulating protein kinase activity. The PKA structure has 
broad significance because residues 40 to 280 constitute a 
conserved catalytic core that is common to essentially all 
known protein kinases. We see here an example of a suc- 
cessful biochemical solution to the problem of protein 
phosphorylation being employed many times in the 
course of evolution. 

The bound peptide in this crystal occupies the active 
site because it contains the pseudosubstrate sequence 
Arg-Arg-Asn-A/a-Ile (Figure 10.30). The structure of 
the complex reveals the basis for the consensus se¬ 
quence. The guanidinium group of the first arginine 
residue forms an ion pair with the carboxylate side chain 
of a glutamate residue (Glu 127) of the enzyme. The sec- 
ond arginine likewise interacts with two other carbox¬ 
ylates. The nonpolar side chain of isoleucine, which 
matches Z in the consensus sequence, fits snugly in a 
hydrophobic groove formed by two leucine residues of 
the enzyme. 


10.5 MANY ENZYMES ARE ACTIVATED BY SPECIFIC 
PROTEOLYTIC CLEAVAGE 

We turn now to a different mechanism of enzyme regulation. Many en- 
zymes acquire full enzymatic activity as they spontaneously fold into their 
characteristic three-dimensional forms. In contrast, other enzymes are syn- 
thesized as inactive precursors that are subsequently activated by cleavage 
of one or a few specific peptide bonds. The inactive precursor is called a 
zymogen (or a proenzyme). A energy source (ATP) is not needed for cleav¬ 
age. Therefore, in contrast with reversible regulation by phosphorylation, 
even proteins located outside cells can be activated by this means. Another 
noteworthy difference is that proteolytic activation, in contrast with al- 
losteric control and reversible covalent modification, occurs just once in the 
life of an enzyme molecule. 

Specific proteolysis is a common means of activating enzymes and other 
proteins in biological systems. For example: 

1. The digestive enzymes that hydrolyze proteins are synthesized as zymo- 
gens in the stomach and pancreas (Table 10.3). 


TABLE 10.3 Gastric and pancreatic zymogens 


Site of synthesis 

Zymogen 

Active enzyme 

Stomach 

Pepsinogen 

Pepsin 

Pancreas 

Chymotrypsinogen 

Chymotrypsin 

Pancreas 

Trypsinogen 

Trypsin 

Pancreas 

Procarboxypeptidase 

Carboxypeptidase 

Pancreas 

Proelastase 

Elastase 








2. Blood clotting is mediated by a cascade of proteolytic activations that en- 
sures a rapid and amplified response to trauma. 

3. Some protein hormones are synthesized as inactive precursors. For ex- 
ample, insulin is derived from proinsulin by proteolytic removal of a peptide. 

4. The fibrous protein collagen, the major constituent of skin and bone, is 
derived from procollagen, a soluble precursor. 

5. Many developmental processes are controlled by the activation of zymo- 
gens. For example, in the metamorphosis of a tadpole into a frog, large 
amounts of collagen are resorbed from the tail in the course of a few days. 
Likewise, much collagen is broken down in a mammalian uterus after de- 
livery. The conversion oíprocollagenase into collagenase, the active protease, 
is precisely timed in these remodeling processes. 

6. Programmed cell death, or apoptosis, is mediated by proteolytic enzymes 
called caspases, which are synthesized in precursor form as procaspases. 
When activated by various signals, caspases function to cause cell death in 
most organisms, ranging from C. elegans (Section 2.4.3) to human beings. 
Apoptosis provides a means sculpting the shapes of body parts in the course 
of development and a means of eliminating cells producing anti-self anti- 
bodies or infected with pathogens as well as cells containing large amounts 
of damaged DNA. 

We next examine the activation and control of zymogens, using as ex- 
amples several digestive enzymes as well as blood-clot formation. 

10.5.1 Chymotrypsinogen Is Activated by Specific Cleavage 
of a Single Peptide Bond 

Chymotrypsin is a digestive enzyme that hydrolyzes proteins in the small 
intestine. Its mechanism of action was discussed in detail in Chapter 9. Its 
inactive precursor, chymotrypsinogen, is synthesized in the pancreas, as are 
several other zymogens and digestive enzymes. Indeed, the pancreas is one 
of the most active organs in synthesizing and secreting proteins. The en¬ 
zymes and zymogens are synthesized in the acinar cells of the pancreas and 
stored inside membrane-bounded granules (Figure 10.31). The zymogen 
granules accumulate at the apex of the acinar cell; when the cell is stimu- 
lated by a hormonal signal or a nerve impulse, the contents of the granules 
are released into a duct leading into the duodenum. 

Chymotrypsinogen, a single polypeptide chain consisting of 245 amino 
acid residues, is virtually devoid of enzymatic activity. It is converted into 
a fully active enzyme when the peptide bond joining arginine 15 and 
isoleucine 16 is cleaved by trypsin (Figure 10.32). The resulting active en¬ 
zyme, called TT-chymotrypsin, then acts on other tt- chymotrypsin mole- 
cules. Two dipeptides are removed to yield a-chymotrypsin, the stable form 
of the enzyme. The three resulting chains in a-chymotrypsin remain linked 
to one another by two interchain disulfide bonds. The striking feature of 
this activation process is that cleavage of a single specific peptide bond trans- 
forms the protein from a catalytically inactive form into one that is fully active. 

10.5.2 Proteolytic Activation of Chymotrypsinogen Leads 
to the Formation of a Substrate-Binding Site 

How does cleavage of a single peptide bond activate the zymogen? Key con- 
formational changes, which were revealed by the elucidation of the three- 
dimensional structure of chymotrypsinogen, result from the cleavage of the 
peptide bond between amino acids 15 and 16. 
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FIGURE 10.31 Secretion of zymogens 
by an acinar cell of the pancreas. 


Chymotrypsinogen 
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1 245 
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FIGURE 10.32 Proteolytic activation of 
chymotrypsinogen. The three chains of 
a-chymotrypsin are linked by two interchain 
disulfide bonds (A to B, and B to C). 
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1. The newly formed amino-terminal group of isoleucine 1 6 turns inward and 
forms an ionic bond with aspartate 194 in the interior of the chymotrypsin 
molecule (Figure 10.33). Protonation of this amino group stabilizes the ac¬ 
tive form of chymotrypsin. 


FIGURE 10.33 Conformations 
of chymotrypsinogen (red) and 
chymotrypsin (blue). The electrostatic 
interaction between the carboxylate of 
aspartate 194 and the a-amino group of 
isoleucine 16, essential for the structure 
of active chymotrypsin, is possible only 
in chymotrypsin. 


Ile 16 



2. This electrostatic interaction triggers a number of conformational 
changes. Methionine 192 moves from a deeply buried position in the zy- 
mogen to the surface of the active enzyme, and residues 187 and 193 be- 
come more extended. These changes result in the formation of the substrate- 
specificity site for aromatic and bulky nonpolar groups. One side of this site 
is made up of residues 189 through 192. This cavity for binding part of the 
substrate is not fully formed in the zymogen. 

3. The tetrahedral transition state in catalysis by chymotrypsin is stabilized 
by hydrogen bonds between the negatively charged carbonyl oxygen atom 
of the substrate and two NH groups of the main chain of the enzyme (Sec- 
tion 9.1.3). One of these NH groups is not appropriately located in chy¬ 
motrypsinogen, and so the oxyanion hole is incomplete in the zymogen. 

4. The conformational changes elsewhere in the molecule are very small. 
Thus, the switching on of enzymatic activity in a protein can be accomplished 
by discrete, highly localized conformational changes that are triggered by the 
hydrolysis of a single peptide bond. 

10.5.3 The Generation of Trypsin from Tryspinogen Leads 
to the Activation of Other Zymogens 

The structural changes accompanying the activation of trypsinogen, the 
precursor of the proteolytic enzyme trypsin, are somewhat different from 
those in the activation of chymotrypsinogen. X-ray analyses have shown 
that the conformation of four stretches of polypeptide, constituting about 
15% of the molecule, changes markedly on activation. These regions, called 
the activation domain, are very flexible in the zymogen, whereas they have a 
well-defined conformation in trypsin. Furthermore, the oxyanion hole (Sec- 
tion 9.1.3) in trypsinogen is too far from histidine 57 to promote the for¬ 
mation of the tetrahedral transition state. 

The digestion of proteins in the duodenum requires the concurrent ac- 
tion of several proteolytic enzymes, because each is specific for a limited 
number of side chains. Thus, the zymogens must be switched on at the same 



Enteropeptidase 
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FIGURE 10.34 Zymogen activation by 
proteolytic cleavage. Enteropeptidase 
initiates the activation of the pancreatic 
zymogens by activating trypsin, which then 
activates other zymogens. Active enzymes 
are shown in yellow; zymogens are shown 
in orange. 


time. Coordinated control is achieved by the action of trypsin as the com- 
mon activator of all the pancreatic zymogens —trypsinogen, chymotrypsino- 
gen, proelastase, procarboxypeptidase, and prolipase, a lipid degrading en- 
zyme. To produce active trypsin, the cells that line the duodenum secrete 
an enzyme, enteropeptidase, that hydrolyzes a unique lysine—isoleucine pep- 
tide bond in trypsinogen as the zymogen enters the duodenum from the 
pancreas. The small amount of trypsin produced in this way activates more 
trypsinogen and the other zymogens (Figure 10.34). Thus, the formation of 
trypsin by enteropeptidase is the master activation step. 

10.5.4 Some Proteolytic Enzymes Have Specific Inhibitors 

The conversion of a zymogen into a protease by cleavage of a single peptide 
bond is a precise means of switching on enzymatic activity. However, this 
activation step is irreversible, and so a different mechanism is needed to stop 
proteolysis. Specific protease inhibitors accomplish this task. For example, 
pancreatic trypsin inhibitor, a 6-kd protein, inhibits 
trypsin by binding very tightly to its active site. The dis- 
sociation constant of the complex is 0.1 pM, which cor- 
responds to a standard free energy of binding of about 
— 18 kcal moP^ ( —75 kj moP^). In contrast with nearly 
all known protein assemblies, this complex is not disso- 
ciated into its constituent chains by treatment with de- 
naturing agents such as 8 M urea or 6 M guanidine hy- 
drochloride. 

The reason for the exceptional stability of the com¬ 
plex is that pancreatic trypsin inhibitor is a very effective 
substrate analog. X-ray analyses showed that the in¬ 
hibitor lies in the active site of the enzyme, positioned 
such that the side chain of lysine 15 of this inhibitor in- 
teracts with the aspartate side chain in the specificity 
pocket of trypsin. In addition, there are many hydrogen 
bonds between the main chain of trypsin and that of its 
inhibitor. Furthermore, the carbonyl group of lysine 15 
and the surrounding atoms of the inhibitor fit snugly in 
the active site of the enzyme. Comparison of the struc- 
ture of the inhibitor bound to the enzyme with that of the 
free inhibitor reveals that the structure is essentially un- 
changed on binding to the enzyme (Figure 10.35). Thus, 
the inhibitor is preorganized into a structure that is highly 
complementary to the enzyme s active site. Indeed, the 



Trypsin-pancreatic trypsin Free pancreatic 

inhibitor complex trypsin inhibitor 


FIGURE 10.35 Interactíon of trypsin with its 
inhibitor. Structure of a complex of trypsin (yellow) and 
pancreatic trypsin inhibitor (red). Lysine 15 of the inhibitor 
penetrates into the active site of the enzyme and forms a sait 
bridge with aspartate 189 in the active site. The bound inhibitor 
and the free inhibitor are almost identical in structure. 






















-1 284 I- 

CHAPTER 10 • Regulatory Strategies: 

Enzymes and Hemoglobin 



FIGURE 10.36 Oxídation of methíoníne 
to the sulfoxíde. 


peptide bond between lysine 15 and alanine 16 in pancreatic trypsin in- 
hibitor is cleaved but at a very slow rate: the half-life of the trypsin—inhibitor 
complex is several months. In essence, the inhibitor is a substrate, but its 
intrinsic structure is so nicely complementary to the enzyme s active site 
that it binds very tightly and is turned over slowly. 

T Why does trypsin inhibitor exist? Indeed, the amount of trypsin is 
much greater that that of the inhibitor. Under what circumstances is 
it beneficiai to inhibit trypsin? Recall that trypsin activates other zymogens. 
Consequently, it is vital that even small amounts of trypsin be prevented 
from initiating the cascade prematurely. Trypsin molecules activated in the 
pancreas or pancreatic ducts could severely damage those tissues, leading 
to acute pancreatitis. Tissue necrosis may result from the activation of pro- 
teolytic enzymes (as well as prolipases) by trypsin, and hemorrhaging may 
result from its activation of elastase. We see here the physiological need for 
the tight binding of the inhibitor to trypsin. 

Pancreatic trypsin inhibitor is not the only important protease inhibitor. 
ai-Antitrypsin (also called ai-antiproteinase), a 53-kd plasma protein, pro- 
tects tissues from digestion by elastase, a secretory product of neutrophils 
(white blood cells that engulf bactéria). Antielastase would be a more accu- 
rate name for this inhibitor, because it blocks elastase much more effectively 
than it blocks trypsin. Like pancreatic trypsin inhibitor, ai-antitrypsin 
blocks the action of target enzymes by binding nearly irreversibly to their 
active sites. Genetic disorders leading to a deficiency of ai-antitrypsin show 
that this inhibitor is physiologically important. For example, the substitu- 
tion of lysine for glutamate at residue 53 in the type Z mutant slows the se- 
cretion of this inhibitor from liver cells. Serum leveis of the inhibitor are 
about 15% of normal in people homozygous for this defect. The consequence 
is that excess elastase destroys alveolar walls in the lungs by digesting elas- 
tic fibers and other connective-tissue proteins. 

The resulting clinicai condition is called emphysema (also known as de- 
structive lung disease). People with emphysema must breathe much harder 
than normal people to exchange the same volume of air, because their alve- 
oli are much less resilient than normal. Cigarette smoking markedly in- 
creases the likelihood that even a type Z heterozygote will develop emphy¬ 
sema. The reason is that smoke oxidizes methionine 358 of the inhibitor 
(Figure 10.36), a residue essential for binding elastase. Indeed, this me¬ 
thionine side chain is the bait that selectively traps elastase. The methionine 
sulfoxide oxidation product, in contrast, does not lure elastase, a striking 
consequence of the insertion of just one oxygen atom into a protein. We will 
consider another protease inhibitor, antithrombin III, when we examine the 
control of blood clotting. 


10.5.5 Blood Clotting Is Accomplished by a Cascade 
of Zymogen Activations 

Enzymatic cascades are often employed in biochemical systems to achieve a 
rapid response. In a cascade, an initial signal institutes a series of steps, each 
of which is catalyzed by an enzyme. At each step, the signal is amplified. 
For instance, if a signal molecule activates an enzyme that in turn activates 
10 enzymes and each of the 10 enzymes in turn activates 10 additional en¬ 
zymes, after four steps the original signal will have been amplified 10,000- 
fold. Blood clots are formed by a cascade of zymogen activations: the acti¬ 
vated form of one clotting factor catalyzes the activation of the next (Figure 
10.37). Thus, very small amounts of the initial factors suffice to trigger the 
cascade, ensuring a rapid response to trauma. 
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FIGURE 10.37 Blood-clottíng Cascade. A fibrin clot is formed by the interplay of the 
intrinsic, extrinsic, and final common pathways. The intrinsic pathway begins with the 
activation of factor XII (Hageman factor) by contact with abnormal surfaces produced by 
injury. The extrinsic pathway is triggered by trauma, which activates factor VII and releases a 
lipoprotein, called tissue factor, from blood vessels. Inactive forms of clotting factors are 
shown in red; their activated counterparts (indicated by the subscript "a") are in yellow. 
Stimulatory proteins that are not themselves enzymes are shown in blue. A striking feature 
of this process is that the activated form of one clotting factor catalyzes the activation of the 
next factor. 


There are two means of initiating clotting: the intrinsic and extrinsic 
pathways. The intrinsic clotting pathway is activated by exposure of anionic 
surfaces on rupture of the endothelial lining of the blood vessels. These 
surfaces serve as binding sites for factors in the clotting Cascade. Sub- 
stances that are released from tissues as a consequence of trauma to them 
trigger the extrinsic clotting pathway. The extrinsic and intrinsic pathways 
converge on a common sequence of final steps to form a clot composed 
of the protein fibrin. The two pathways interact with each other in vivo. 
Indeed, both are needed for proper clotting, as evidenced by clotting dis- 
orders caused by a deficiency of a single protein in one of the pathways. 
Note that the active forms of the clotting factors are designated with a sub¬ 
script ‘‘a.'' 

10.5.6 Fibrinogen Is Converted by Thrombin into a Fibrin Clot 

The best-characterized part of the clotting process is the final step in the 
Cascade: the conversion of fibrinogen into fibrin by thrombin, a proteolytic 
enzyme. Fibrinogen is made up of three globular units connected by two 
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FIGURE 10.38 Structure of a 
fíbrínogen molecule. (A) A ribbon 
diagram. The two rod regions are a-helical 
coiled coils, connected to a globular 
region at each end. (B) A schematic 
representation showing the positions of 
the fibrinopeptides A and B. 



rods (Figure 10.38). This 340-kd protein consists of six chains: two each of 
Aa, Bp, and 7 . The rod regions are triple-stranded a-helical coiled coils, a 
recurring motif in proteins. Thrombin cleaves four arginine-glycine peptide 
bonds in the central globular region of fibrinogen. On cleavage, an A pep¬ 
tide of 18 residues from each of the two Aa chains and a B peptide of 20 
residues from each of the two Bp chains are released from the globular re¬ 
gion. These A and B peptides are called fibrinopeptides. A fibrinogen mol¬ 
ecule devoid of these fibrinopeptides is called a fibrin monomer and has the 
subunit structure (aP 7 ) 2 . 

Fibrin monomers spontaneously assemble into ordered fibrous arrays 
called^brin. Electron micrographs and low-angle x-ray patterns show that 
fibrin has a periodic structure that repeats every 23 nm (Figure 10.39). 


FIGURE 10.39 Electron mícrograph of 
fibrin. The 23-nnn period along the fiber 
axis is half the length of a fibrinogen 
molecule. [Courtesy of Dr. Henry Slayter.] 



Higher-resolution images reveal the detailed structure of the fibrin 
monomer, how the fibrin monomers come together, and how this assembly 
is facilitated by remo vai of the fibrinopeptides. The homologous P and 7 
chains have globular domains at the carboxyl-terminal ends (Figure 10.40). 
These domains have binding ‘Toles’' that interact with peptides. The P do- 
main is specific for sequences of the form H 3 N^-Gly-His-Arg-, whereas 
the 7 domain binds H 3 N^-Gly-Pro-Arg-. Exactly these sequences (some- 


FIGURE 10.40 Formation of a fibrin 
clot. (1) Thrombin cleaves fibrinopeptides 
A and B from the central globule of 
fibrinogen. (2) Globular domains at the 
carboxyl-terminal ends of the p and 7 
chains interact with "knobs" exposed at 
the amino-terminal ends of the p and a 
chains to form clots. 
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times called ‘‘knobs'') are exposed at the amino-terminal ends of the (3 and 
a chains, respectively, on thrombin cleavage. The knobs of the a subunits 
fit into the holes on the 7 subunits of another monomer to form a protofi- 
bril. This protofibril is extended when the knobs of the |3 subunits fit into 
holes of P subunits of other protofibrils. Thus, analogous to the activation 
of chymotrypsinogen, peptide-bond cleavage exposes new amino termini 
that can participate in specific interactions. The newly formed clot is stabi- 
lized by the formation of amide bonds between the side chains of lysine and 
glutamine residues in different monomers. 



This cross-linking reaction is catalyzed by transglutaminase (factor XIIIJ, 
which itself is activated from the protransglutaminase form by thrombin. 


12.5.7 Prothrombin Is Readied for Activation by a Vitamin 
K-Dependent Modification 

Thrombin is synthesized as a zymogen cdlled prothrombin, which comprises 
four major domains, with the serine protease domain at its carboxyl termi- 
nus. The first domain is called a gla domain, whereas domains 2 and 3 are 
called kringle domains (Figure 10.41). These domains work in concert to keep 
prothrombin in an inactive form and to target it to appropriate sites for its 
activation by factor (a serine protease) and factor Va (a stimulatory pro- 
tein). Activation is accomplished by proteolytic cleavage of the bond be¬ 
tween arginine 274 and threonine 275 to release a fragment containing the 
first three domains and by cleavage of the bond between arginine 323 and 
isoleucine 324 (analogous to the key bond in chymotrypsinogen) to yield ac¬ 
tive thrombin. 
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FIGURE 10.41 Modular structure of prothrombin. Cleavage of two peptide bonds 
yields thrombin. All the 7 -carboxyglutamate residues are in the gla domain. 

T Vitamin K (Section 8.6.2 and Figure 10.42) has been known for 
many years to be essential for the synthesis of prothrombin and sev- 
eral other clotting factors. The results of studies of the abnormal pro¬ 
thrombin synthesized in the absence of vitamin K or in the presence of vi¬ 
tamin K antagonists, such as dicoumarol, revealed the mode of action of this 
vitamin. Dicoumarol is found in spoiled sweet clover and causes a fatal 
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FIGURE 10.42 Structures of vitamin K 
and two antagonists, dicoumarol and 
warfarin. 
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FIGURE 10.43 The caldum-bindíng 
regíon of prothrombín. Prothrombin 
binds calcium ions with the modified 
amino acid 7 -carboxyglutannate (red). 


I- 

An account of a hemorrhagic 
disposition existing in certain families— 

"About seventy or eighty years ago, a 
woman by the name of Smith settled 
in the vicinity of Plymouth, New Hamp- 
shire, and transmitted the following 
idiosyncrasy to her descendants. It is 
one, she observed, to which her family 
is unfortunately subject and has been 
the source not only of great solicitude, 
but frequentiy the cause of death. If 
the least scratch is made on the skin of 
some of them, as mortal a hemorrhage 
wili eventually ensue as if the largest 
wound is inflicted.... It is a surprising 
circumstance that the males only are 
subject to this strange affection, and 
that all of them are not liable to it.... 
Although the females are exempt, they 
are stili capable of transmitting it to 
their male children." 

John Otto (1803) 


hemorrhagic disease in cattle fed on this hay. This coumarin derivative is 
used clinically as an anticoagulant to prevent thromboses in patients prone 
to clot formation. Dicoumarol and such related vitamin K antagonists as 
warfarin also serve as effective rat poisons. Cows fed dicoumarol synthesize 
an abnormal prothrombin that does not bind Ca^^, in contrast with normal 
prothrombin. This difference was puzzling for some time because abnor¬ 
mal prothrombin has the same number of amino acid residues and gives the 
same amino acid analysis after acid hydrolysis as does normal prothrombin. 

Nuclear magnetic resonance studies revealed that 
normal prothrombin contains y-carboxyglutamate, a 
previously unknown residue that evaded detection 
because its second carboxyl group is lost on acid hy¬ 
drolysis. The abnormal prothrombin formed subse- 
quent to the administration of anticoagulants lacks 
this modified amino acid. In fact, the first 10 gluta- 
mate residues in the amino-terminal region of pro¬ 
thrombin are carboxylated to 7 -carboxyglutamate 
by a vitamin K-dependent enzyme system (Figure 
10.43). The vitamin K-dependent carboxylation reaction converts glutamate, 
a weak chelator of into y-carboxyglutamate, a much stronger chelator. 

Prothrombin is thus able to bind Ca^^, but what is the effect of this bind- 
ing? The binding of Ca^^ by prothrombin anchors the zymogen to phos- 
pholipid membranes derived from blood platelets after injury. The binding 
of prothrombin to phospholipid surfaces is crucial because it brings pro¬ 
thrombin into dose proximity to two clotting proteins that catalyze its con- 
version into thrombin. The proteolytic activation of prothrombin removes 
the calcium-binding domain and frees thrombin from the membrane so that 
it can cleave fibrinogen and other targets. 

10.5.8 Hemophilia Revealed an Early Step in Clotting 

7 Some important breakthroughs in the elucidation of clotting path 
ways have come from studies of patients with bleeding disorders. 
Classic hemophilia, or hemophilia A, the best-known clotting defect, is ge- 
netically transmitted as a sex-linked recessive characteristic. In classic he¬ 
mophilia, factor VIII (antihemophilic factor) of the intrinsic pathway is miss- 
ing or has markedly reduced activity. Although factor VIII is not itself a 
protease, it markedly stimulates the activation of factor X, the final protease 
of the intrinsic pathway, by factor IX^, a serine protease (Figure 10.44). 
Thus, activation of the intrinsic pathway is severely impaired in hemophilia. 

FIGURE 10.44 Action of antihemophilic factor. 

Antihemophilic factor (VIII) stimulates the activation of factor 
X by factor IXg. It is interesting to note that the activity of 
factor VIII is markedly increased by limited proteolysis by 
thrombin and factor Xg. This positive feedback amplifies the 
clotting signal and accelerates clot formation after a threshold 
has been reached. 

In the past, hemophiliacs were treated with transfusions of a concen- 
trated plasma fraction containing factor VIII. This therapy carried the risk 
of infection. Indeed, many hemophiliacs contracted hepatitis and AIDS. A 
safer preparation of factor VIII was urgently needed. With the use of bio- 
chemical purification and recombinant DNA techniques, the gene for fac¬ 
tor VIII was isolated and expressed in cells grown in culture. Recombinant 
factor VIII purified from these cells has largely replaced plasma concen- 
trates in treating hemophilia. 


Antihemophilic factor 
(VIII) 


0 H''' 


/ Proteolysis 



O 

7-Carboxyglutamate residue 









10.5.9 The Clotting Process Must Be Precisely Regulated 

There is a fine line between hemorrhage and thrombosis. Clots must 
^ form rapidly yet remain confined to the area of injury. What are the 
mechanisms that normally limit clot formation to the site of injury? The la- 
bility of clotting factors contributes significantly to the control of clotting. Ac- 
tivated factors are short-lived because they are diluted by blood flow, removed 
by the liver, and degraded by proteases. For example, the stimulatory pro- 
teins factors Va and VlIIa are digested by protein C, a protease that is switched 
on by the action of thrombin. Thus, thrombin has a dual function: it catalyzes 
the formation of fibrin and it initiates the deactivation of the clotting cascade. 

Specific inhibitors of clotting factors are also criticai in the termination 
of clotting. The most important one is antithrombin IIf a plasma protein 
that inactivates thrombin by forming an irreversible complex with it. An¬ 
tithrombin III resembles a^-antitrypsin except that it inhibits thrombin 
much more strongly than it inhibits elastase. Antithrombin III also blocks 
other serine proteases in the clotting cascade—namely, factors Xlla, Xla, IXa, 
and Xa. The inhibitory action of antithrombin III is enhanced by heparin, a 
negatively charged polysaccharide found in mast cells near the walls of blood 
vessels and on the surfaces of endothelial cells (Figure 10.45). Heparin acts 
as an anticoagulant by increasing the rate of formation of irreversible com- 
plexes between antithrombin III and the serine protease clotting factors. An- 
titrypsin and antithrombin are serpins, a family of serine protease inhibitors. 

The importance of the ratio of thrombin to antithrombin is illustrated 
in the case in which a 14-year-old boy died of a bleeding disorder because 
of a mutation in his -antitrypsin, which normally inhibits elastase (Section 
10.5.4). Methionine 358 in ai-antitrypsin s binding pocket for elastase was 
replaced by arginine, resulting in a change in specificity from an elastase in- 
hibitor to a thrombin inhibitor. a^-Antitrypsin activity normally increases 
markedly after injury to counteract excess elastase arising from stimulated 
neutrophils. The mutant ai-antitrypsin caused the patient s thrombin ac¬ 
tivity to drop to such a low levei that hemorrhage ensued. We see here a 
striking example of how a change of a single residue in a protein can drama- 
tically alter specificity and an example of the criticai importance of having the 
right amount of a protease inhibitor. 

Antithrombin limits the extent of clot formation, but what happens to the 
clots themselves? Clots are not permanent structures but are designed to be 
dissolved when the structural integrity of damaged areas is restored. Fibrin is 
split hy plasmin, a serine protease that hydrolyzes peptide bonds in the coiled- 
coil regions. Plasmin molecules can diffuse through aqueous channels in the 
porous fibrin clot to cut the accessible connector rods. Plasmin is formed by 
proteolytic activation of plasminogen, an inactive precursor that has a high 
affinity for the fibrin clots. This conversion is carried out by tissue-type plas¬ 
minogen activator (TPA), a 72-kd protein that has a domain structure closely 
related to that of prothrombin (Figure 10.46). 


Fibrin binding 

Kringle 

Kringle 


Serine protease 


FIGURE 10.46 Modular structure of tissue-type plasminogen activator (TPA). 


However, a domain that targets TPA to fibrin clots replaces the membrane- 
targeting gla domain of prothrombin. The TPA bound to fibrin clots swiftly 
activates adhering plasminogen. In contrast, TPA activates free plasminogen 
very slowly. The gene for TPA has been cloned and expressed in cultured mam- 
malian cells. The results of clinicai studies have shown that TPA administered 
intravenously within an hour of the formation of a blood clot in a coronary ar- 
tery markedly increases the likelihood of surviving a heart attack (Figure 10.47). 



FIGURE 10.45 Electron micrograph of 
a mast cell. Heparin and other molecules 
in the dense granules are released into the 
extracellular space when the cell is 
triggered to secrete. [Courtesy of Lynne Mercer.] 
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FIGURE 10.47 The effect of tissue- 
type plasminogen factor. TPA leads to 
the dissolution of blood clots, as shown 
by x-ray images of blood vessels in the 
heart (A) before and (B) 3 hours after 
the administration of TPA. The position of 
the clot is marked by the arrow in part A. 
[After F. Van de Werf, P. A. Ludbrook, S. R. 
Bergmann, A. J. Tiefenbrunn, K. A. A. Fox, H. de 
Geest, M. Verstraete, D. Collen, and B. E. Sobel. 
New Engl. J. Med. 310(1984):609.] 
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Enzymes and Hemoglobin Aspartate Transcarbamoylase Is Allosterically Inhibited by the End 

Product of Its Pathway 

Allosteric proteins constitute an important class of proteins whose bio- 
logical activity can be regulated. Specific regulatory molecules can mod- 
ulate the activity of allosteric proteins by binding to distinct regulatory 
sites, separate from the functional site. These proteins have multiple 
functional sites, which display cooperation as evidenced by a sigmoidal 
dependence of function on substrate concentration. Aspartate transcar¬ 
bamoylase (ATCase), one of the best-understood allosteric enzymes, 
catalyzes the synthesis of N-carbamoylaspartate, the first intermediate 
in the synthesis of pyrimidines. ATCase is feedback inhibited by cyti- 
dine triphosphate(CTP), the final product of the pathway. ATP re- 
verses this inhibition. ATCase consists of separable catalytic ( 03 ) sub- 
units (which bind the substrates) and regulatory (r 2 ) subunits (which 
bind CTP and ATP). The inhibitory effect of CTP, the stimulatory ac- 
tion of ATP, and the cooperative binding of substrates are mediated by 
large changes in quaternary structure. On binding substrates, the C 3 
subunits of the Cere enzyme move apart and reorient themselves. This 
allosteric transition is highly concerted, as postulated by the Monod- 
Wyman-Changeux (MWC) model. All subunits of an ATCase mole- 
cule simultaneously interconvert from the T (low-affinity) to the R 
(high-affinity) State. 

o Hemoglobin Transports Oxygen Efficientiy by Binding Oxygen 
Cooperatively 

Hemoglobin, the oxygen carrier in the blood, is an allosteric protein. 
Hemoglobin consists of four polypeptide chains, each with a heme 
group—a substituted porphyrin with a central iron. Hemoglobin A, the 
predominant hemoglobin in adults, has the subunit structure a 2 | 32 . He¬ 
moglobin transports H^ and CO 2 in addition to O 2 . Hemoglobin ex- 
hibits three kinds of allosteric effects. First, the oxygen-binding curve 
of hemoglobin is sigmoidal, which indicates that the binding of oxygen 
is cooperative. The binding of oxygen to one heme group facilitates the 
binding of oxygen to the other heme groups in the same molecule. Sec- 
ond, the binding of H^ and CO 2 promotes the release of O 2 from he¬ 
moglobin, an effect that is physiologically important in enhancing the 
release of O 2 in metabolically active tissues such as muscle. These al¬ 
losteric linkages between the binding of H^, CO 2 , and O 2 are known 
as the Bohr effect. Third, the affinity of hemoglobin for O 2 is further 
regulated by 2,3-bisphosphoglycerate (2,3-BPG), a small molecule with 
a very high density of negative charge. 2,3-Bisphosphoglycerate binds 
tightly to deoxyhemoglobin but not to oxyhemoglobin. Hence, 2,3- 
BPG lowers the oxygen affinity of hemoglobin. Fetal hemoglobin {oi 2 "Í 2 ) 
has a higher oxygen affinity than human adult hemoglobin because fe¬ 
tal hemoglobin binds 2,3-BPG less tightly. Neither the sequential nor 
the concerted model completely describes the allosteric behavior of he¬ 
moglobin. Rather, the behavior of hemoglobin is best described by a 
combined model that employs features of both models. 

Isozymes Provide a Means of Regulation Specific to Distinct Tissues and 
Developmental Stages 

Isozymes differ in structural characteristics but catalyze the same reac- 
tion. They provide a means of fine-tuning metabolism to meet the needs 
of a given tissue or developmental stage. The results of gene-duplication 
events provide the means for subtle regulation of enzyme function. 
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Covalent modification of proteins is a potent means of controlling the Selected Readings 

activity of enzymes and other proteins. Phosphorylation is the most 

common type of reversible covalent modification. Signals can be highly 

amplified by phosphorylation because a single kinase can act on many 

target molecules. The regulatory actions of protein kinases are reversed 

by protein phosphatases, which catalyze the hydrolysis of attached 

phosphoryl groups. 

Cyclic AMP serves as an intracellular messenger in the transduction 
of many hormonal and sensory stimuli. Cyclic AMP switches on pro¬ 
tein kinase A (PKA), a major multifunctional kinase, by binding to the 
regulatory subunit of the enzyme, thereby releasing the active catalytic 
subunits of PKA. In the absence of cAMP, the catalytic sites of PKA 
are occupied by pseudosubstrate sequences of the regulatory subunit. 

• Many Enzymes Are Activated by Specific Proteolytic Cleavage 

The activation of an enzyme by proteolytic cleavage of one or a few pep- 
tide bonds is a recurring control mechanism seen in processes as diverse 
as the activation of digestive enzymes and blood clotting. The inactive 
precursor is called a zymogen (or a proenzyme). Trypsinogen is acti¬ 
vated by enteropeptidase or trypsin, and trypsin then activates a host 
of other zymogens, leading to the digestion of foodstuffs. For instance, 
trypsin converts chymotrypsinogen, a zymogen, into active chymo- 
trypsin by hydrolyzing a single peptide bond. 

A striking feature of the clotting process is that it is accomplished 
by a Cascade of zymogen conversions, in which the activated form of 
one clotting factor catalyzes the activation of the next precursor. Many 
of the activated clotting factors are serine proteases. In the final step of 
clot formation, fibrinogen, a highly soluble molecule in the plasma, is 
converted by thrombin into fibrin by the hydrolysis of four argi- 
nine-glycine bonds. The resulting fibrin monomer spontaneously 
forms long, insoluble fibers called fibrin. Zymogen activation is also es- 
sential in the lysis of clots. Plasminogen is converted into plasmin, a ser¬ 
ine protease that cleaves fibrin, by tissue-type plasminogen activator 
(TPA). Although zymogen activation is irreversible, specific inhibitors 
of some proteases exert control. The irreversible protein inhibitor an- 
tithrombin III holds blood clotting in check in the clotting cascade. 
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-1 PROBLEMS I- 

1. Activity profile. A histidine residue in the active site of as- 
partate transcarbamoylase is thought to be important in stabi- 
lizing the transition State of the bound substrates. Predict the pH 
dependence of the catalytic rate, assuming that this interaction 
is essential and dominates the pH-activity profile of the enzyme. 

2. Allosteric switching. A substrate binds 100 times as tightly 
to the R State of an allosteric enzyme as to its T State. Assume 
that the concerted (MWC) model applies to this enzyme. 

(a) By what factor does the binding of one substrate molecule 
per enzyme molecule alter the ratio of the concentrations of en¬ 
zyme molecules in the R and T States? 

(b) Suppose that L, the ratio of [T] to [R] in the absence of sub¬ 
strate, is 10^ and that the enzyme contains four binding sites for 
substrate. What is the ratio of enzyme molecules in the R State 
to that in the T State in the presence of saturating amounts of 
substrate, assuming that the concerted model is obeyed? 

3. Allosteric transition. Consider an allosteric protein that obeys 

the concerted model. Suppose that the ratio of T to R formed in 
the absence of ligand is 10^, = 2 mM, and Rr = 5 pM. The 

protein contains four binding sites for ligand. What is the fraction 
of molecules in the R form when 0,1,2,3, and 4 ligands are bound? 

4. Negative cooperativity. You have isolated a dimeric enzyme 
that contains two identical active sites. The binding of substrate 
to one active site decreases the substrate affinity of the other ac¬ 
tive site. Which allosteric model best aceounts for this negative 
cooperativity? 

5. Paradoxical atfirst glance. Recall that phosphonacetyl-L-as- 
partate (PALA) is a potent inhibitor of ATCase because it mim- 
ies the two physiological substrates. However, low concentra¬ 
tions of this unreactive bisubstrate analog increase the reaction 
velocity. On the addition of PALA, the reaction rate increases 
until an average of three molecules of PALA are bound per mol¬ 
ecule of enzyme. This maximal velocity is 17-fold as great as it 
is in the absence of PALA. The reaction rate then decreases to 
nearly zero on the addition of three more molecules of PALA 
per molecule of enzyme. Why do low concentrations of PALA 
activate ATCase? 



PALA/ATCase (mole ratio) 


6. R versus T. An allosteric enzyme that follows the concerted 
mechanism has a T/R ratio of 300 in the absence of substrate. 
Suppose that a mutation reversed the ratio. How would this mu- 
tation affect the relation between the rate of the reaction and sub¬ 
strate concentration? 

7. Tuning oxygen affinity. What is the effect of each of the folio w- 
ing treatments on the oxygen affinity of hemoglobin A in vitro? 

(a) Increase in pH from 7.2 to 7.4. 

(b) Increase in pC02 from 10 to 40 torr. 

(c) Increase in [2,3-BPG] from 2 X 10~"^ to 8 X 10~^ M. 

(d) Dissociation of a2|32 into monomeric subunits. 

8. Avian and reptilian counterparts. The erythrocytes of birds 
and turtles contain a regulatory molecule different from 2,3- 
BPG. This substance is also effective in reducing the oxygen 
affinity of human hemoglobin stripped of 2,3-BPG. Which of 
the following substances would you predict to be most effective 
in this regard? 

(a) Glucose 6-phosphate (d) Malonate 

(b) Inositol hexaphosphate (e) Arginine 

(c) HP04^~ (f) Lactate 

9. Tuning proton affinity. The pK of an acid depends partly on 

its environment. Predict the effect of the following environmen- 
tal changes on the pK of a glutamic acid side chain. 

(a) A lysine side chain is brought into dose proximity. 

(b) The terminal carboxyl group of the protein is brought into 
dose proximity. 

(c) The glutamic acid side chain is shifted from the outside of 
the protein to a nonpolar site inside. 

10. Zymogen activation. When very low concentrations of 
pepsinogen are added to acidic media, how does the half-time 
for activation depend on zymogen concentration? 

11. A revealing assay. Suppose that you have just examined a 
young boy with a bleeding disorder highly suggestive of classic 
hemophilia (factor VIII deficiency). Because of the late hour, the 
laboratory that carries out specialized coagulation assays is 
closed. However, you happen to have a sample of blood from a 
classic hemophiliac whom you admitted to the hospital an hour 
earlier. What is the simplest and most rapid test that you can 
perform to determine whether your present patient also is defi- 
cient in factor VIII activity? 

12. Counterpoint. The synthesis of factor X, like that of pro- 
thrombin, requires vitamin K. Factor X also contains y-car- 
boxyglutamate residues in its amino-terminal region. However, 
activated factor X, in contrast with thrombin, retains this region 
of the molecule. What is a likely functional consequence of this 
difference between the two activated species? 

13. A discerning inhibitor. Antithrombin III forms an irre- 
versible complex with thrombin but not with prothrombin. 
What is the most likely reason for this difference in reactivity? 


Effect of PALA on ATCase rate. 
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14. Repeating heptads. Each of the three types of fibrin chains 
contains repeating heptapeptide units (abcdefg) in which residues 
a and d are hydrophobic. Propose a reason for this regularity. 

15. Drug design. A drug company has decided to use recombinant 
DNA methods to prepare a modified a^-antitrypsin that will be 
more resistant to oxidation than is the naturally occurring inhibitor. 
Which single amino acid substitution would you recommend? 

Data Interpretation Problems 

16. Distinguishing hetween models. The graph below shows the 
fraction of an allosteric enzyme in the R State (/r) and the frac- 
tion of active sites bound to substrate (Y) as a function of sub- 
strate concentration. Which model, the concerted or sequential, 
best explains these results? 



[From M. W. Kirschner and H. K. Schachman, Biochemistry 12 (1966):2997.] 

17. Reporting live from ATCase. ATCase was reacted with 
nitromethane to form a colored nitrotyrosine group (X^ax “ 
430 nm) in each of its catalytic chains. The absorption by this 
repórter group depends on its immediate environment. An es- 
sential lysine residue at each catalytic site also was modified to 
block the binding of substrate. Catalytic trimers from this dou- 
bly modified enzyme were then combined with native trimers to 
form a hybrid enzyme. The absorption by the nitrotyrosine group 
was measured on addition of the substrate analog succinate. 
What is the significance of the alteration in the absorbance at 
430 nm? 



[After H. K. Schachman. 7. Biol. Chem. 263 (1988):18583] 

The binding of succinate, an unreactive analog of aspartate, to 
one trimer changed the visible absorption spectrum of nitroty¬ 
rosine residues attached to the other trimer. 


18. Reporting live from ATCase 2. A different ATCase hybrid 
was constructed to test the effects of allosteric activators and in- 
hibitors. Normal regulatory subunits were combined with ni¬ 
tro tyr o sine-containing catalytic subunits. The addition of ATP 
in the absence of substrate increased the absorbance at 430 nm, 
the same change elicited by the addition of succinate (see the 
graph in Problem 17). Conversely, CTP in the absence of sub¬ 
strate decreased the absorbance at 430 nm. What is the signifi¬ 
cance of the changes in absorption of the repórter groups?. 



Wavelength (nm) 

[After H. K. Schachman. J. Biol. Chem. 263(1988):18583.] 

Chapter Integration Problem 

19. Sticky patches. The substitution of valine for glutamate at 
position 6 of the |3 chains of hemoglobin places a nonpolar 
residue on the outside of hemoglobin S, the version of hemoglo¬ 
bin responsible for sickle-cell anemia. The oxygen affinity and 
allosteric properties of hemoglobin are virtually unaffected by 
this change. However, the valine side chain of hemoglobin S in- 
teracts with a complementary sticky patch (formed by phenyl- 
alanine |385 and leucine |388) on another hemoglobin mole- 
cule—a patch that is exposed in deoxygenated but not in 
oxygenated hemoglobin. What is the chemical basis for the in- 
teraction between the hemoglobin molecules? What would be 
the effect of this interaction? 

Mechanism Problems 

20. Aspartate transcarbamoylase. Write the mechanism (in de- 
tail) for the conversion of aspartate and carbamoyl phosphate 
into N-carbamoylaspartate. Include a role for the residue histi- 
dine present in the active site. 

21. Protein kinases. Write a mechanism (in detail) for the phos- 
phorylation of a serine residue by ATP catalyzed by a protein 
kinase. What groups might you expect to find in the enzyme ac¬ 
tive site? 

Media Problem 

22. Cooperative, more or less. (a) Suppose you purify the same 
enzyme from two different species and find that one is a tetramer 
(n = 4) and the other a hexamer (n = 6). All other things being 
equal, which would you expect would show the greater cooper- 
ativity? (b) To your surprise, both enzymes show similar coop- 
erativity. Explain how differences in the MWC parameter c 
could explain this. (Hint: Make sure you do the exercise in Sec- 
tion 4 of the Conceptual Insights module on cooperative bind¬ 
ing and kinetics.) 
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Carbohydrates 




Carbohydrates ín food are important sources of energy. Starch, found in plant- 
derived food such as pasta, consists of chains of linked glucose molecules. These 
chains are broken down into individual glucose molecules for eventual use in 
generation of ATP and building blocks for other molecules. [(Left) Superstock.] 


Let us take an overview of carbohydrates, one of the four major classes of 
biomolecules along with proteins, nucleic acids, and lipids. Carbohydrates 
are aldehyde or ketone compounds with multiple hydroxyl groups. They 
make up most of the organic matter on Earth because of their extensive roles 
in all forms of life. First, carbohydrates serve as energy Stores, fuels, and 
metabolic intermediates. Second, ribose and deoxyribose sugars form part of 
the structural framework of RNA and DNA. Third, polysaccharides are 
structural elements in the cell walls of bactéria and plants. In fact, cellulose, 
the main constituent of plant cell walls, is one of the most 
abundant organic compounds in the biosphere. Fourth, 
carbohydrates are linked to many proteins and lipids, where 
they play key roles in mediating interactions among cells 
and interactions between cells and other elements in the cel- 
lular environment. 

A key related property of carbohydrates in their role as 
mediators of cellular interactions is the tremendous struc¬ 
tural diversity possible within this class of molecules. Car¬ 
bohydrates are built from monosaccharides, small mole¬ 
cules that typically contain from three to nine carbon atoms 
and vary in size and in the stereochemical configuration at 
one or more carbon centers. These monosaccharides may 
be linked together to form a large variety of oligosaccharide 
structures. The unraveling of these oligosaccharide struc- 
tures, the discovery of their placement at specific sites 
within proteins, and the determination of their function are 
tremendous challenges in the field of proteomics. 


OUTLINE 


11.1 Monosaccharides Are Aldehydes 
or Ketones with Multiple Hydroxyl 
Groups 

11.2 Complex Carbohydrates Are 
Formed by Linkage of Monosaccharides 

11.3 Carbohydrates Can be Attached 
to Proteins to Form Glycoproteins 

o 11.4 Lectins Are Specific Carbohydrate- 
Binding Proteins 



























CHAPTER 11 • Carbohydrates 


11.1 MONOSACCHARIDES ARE ALDEHYDES OR KETONES 
WITH MULTIPLE HYDROXYL GROUPS 


Monosaccharides, the simplest carbohydrates, are aldehydes or ketones that 
have two or more hydroxyl groups; the empirical formula of many is 
(C-H20)^, literally a ‘‘carbon hydrate/' Monosaccharides are important 
fuel molecules as well as building blocks for nucleic acids. The small- 
est monosaccharides, for which n = 3, are dihydroxyacetone and D- and 
L - gly cer aldehy de. 


HO 

y.»2 


V" 

o=c 

H"" \ 

HO \ 

,CH2 

,CH2 

/CH2 

/ 

/ 

/ 

HO 

HO 

HO 

Dihydroxyacetone 

D-Glyceraldehyde 

L-Glyceraldehyde 

(a ketose) 

(an aldose) 

(an aldose) 



They are referred to as trioses (tri- for 3). Dihydroxyacetone is called a 
ketose because it contains a keto group, whereas glyceraldehyde is called an 
aldose because it contains an aldehy de group. Glyceraldehyde has a sin¬ 
gle asymmetric carbon and, thus, there are two stereoisomers of this sugar. 
D-Glyceraldehyde and L- glyceraldehyde are enantiomers, or mirror images 
of each other. As mentioned in Chapter 3, the prefixes D and L designate 
the absolute configuration. Monosaccharides and other sugars will often be 
represented in this book by Fischer projections (Figure 11.1). Recall that, in 
a Fischer projection of a molecule, atoms joined to an asymmetric carbon 
atom by horizontal bonds are in front of the plane of the page, and those 
joined by vertical bonds are behind (see the Appendix in Chapter 1). 
Fischer projections are useful for depicting carbohydrate structures be¬ 
cause they provide clear and simple views of the stereochemistry at 
each carbon center. 
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Dihydroxyacetone 


FIGURE 11.1 Fischer projections of trioses. The top structure reveals the stereochemical 
relations assumed for Fischer projections. 


Simple monosaccharides with four, five, six, and seven carbon atoms are 
called tetroses, pentoses, hexoses, and heptoses, respectively. Because these 
molecules have multiple asymmetric carbons, they exist as diastereoisomers, 
isomers that are not mirror images of each other, as well as enantiomers. In 
regard to these monosaccharides, the symbols D and L designate the ab- 
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FIGURE 11.2 D-Aldoses containing three. 

four, five. 

and six carbon atoms. 






D-Aldoses contain an aldehyde group (shown in blue) and have the absolute configuration 
of D-glyceraldehyde at the asymmetric center (shown in red) farthest from the aldehyde 
group. The numbers indicate the standard designations for each carbon atom. 


solute configuration of the asymmetric carbon farthest from the aldehyde 
or keto group. Figure 11.2 shows the common D-aldose sugars. D-Ribose, 
the carbohydrate component of RNA, is a five-carbon aldose. D-Glucose, 
D-mannose, and D-galactose are abundant six-carbon aldoses. Note that 
D-glucose and D-mannose differ in configuration only at C-2. Sugars dif- 
fering in configuration at a single asymmetric center are called epimers. 
Thus, D-glucose and D-mannose are epimeric at C-2; D-glucose and 
D-galactose are epimeric at C-4. 

Dihydroxyacetone is the simplest ketose. The stereochemical relation 
between D-ketoses containing as many as six carbon atoms are shown in 
Figure 11.3. Note that ketoses have one fewer asymmetric center than do 
aldoses with the same number of carbons. D-Fructose is the most abundant 
ketohexose. 
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The keto group is shown in blue. The 
asymmetric center farthest from the keto 

5 

H-C—OH 

1 

H-C—OH 

1 

H-C^OH 
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group, which determines the d designation. 

6 

CH 2 OH 

CH 2 OH 

CH 2 OH 

CH 2 OH 

is shown in red. 


D-Psicose 

D-Fructose 

D-Sorbose 

D-Tagatose 


11.1.1 Pentoses and Hexoses Cyclize to Form Furanose and 
Pyranose Rings 

The predominant forms of ribose, glucose, fructose, and many other sug- 
ars in solution are not open chains. Rather, the open-chain forms of these 
sugars cyclize into rings. In general, an aldehyde can react with an alcohol 
to form a hemiacetal. 



Furan 



Pyran 


O 

II 

+ HOR' 

Aldehyde Alcohol 

For an aldohexose such as glucose, the C-1 aldehyde in the open-chain form 
of glucose reacts with the C-5 hydroxyl group to form an intramolecular 
hemiacetal The resulting cyclic hemiacetal, a six-membered ring, is called 
pyranose because of its similarity to pyran (Figure 11.4). Similarly, a ketone 
can react with an alcohol to form a hemiketal 

O HO OR" 

II \/ 

X + HOR" T-- X. 

^R' R^ ^R' 

Ketone Alcohol Hemiketal 

The C-2 keto group in the open-chain form of a ketohexose, such as fruc¬ 
tose, can form an intramolecular hemiketal by reacting with either the 



Hemiacetal 
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CH2OH 



D-Glucose 

(open-chain form) 



H OH 

a-D-Glucopyranose 


CH2OH 



H OH 
p-D-Glucopyranose 


C-6 hydroxyl group to form a six-membered cyclic hemiketal or the C-5 
hydroxyl group to form a five-membered cyclic hemiketal (Figure 11.5). The 
five-membered ring is called difuranose because of its similarity to furan. 


0. XH20H 

HO-C—H 

HOH2C 

H^ 

H^ 

.OH 

. CH2OH 

H-C—OH 

/ 



H-C— OH 

1 

HOl 

/ 

H 

\ 

1 

CH2OH 

D-Fructose 




(open-chain form) 


HOH2C CH2OH 

. 0 . 



OH H 


a-D-Fructofuranose 

(a cyclic form of fructose) 


The depictions of glucopyranose and fructofuranose shown in Figures 
11.4 and 11.5 are Haworth projections. In such projections, the carbon atoms 
in the ring are not explicitly shown. The approximate plane of the ring is 
perpendicular to the plane of the paper, with the heavy line on the ring pro- 
jecting toward the reader. Like Fischer projections, Haworth projections al- 
low easy depiction of the stereochemistry of sugars. We will return to a more 
structurally realistic view of the conformations of cyclic monosaccharides 
shortly. 

An additional asymmetric center is created when a cyclic hemiacetal is 
formed. In glucose, C-1, the carbonyl carbon atom in the open-chain form, 
becomes an asymmetric center. Thus, two ring structures can be formed: 
o: -D -glucopyranose and j8 -D- glucopyranose (see Figure 11.4). For D sugars 
drawn as Haworth projections, the designation a means that the hydroxyl 
group attached to C-1 is below the plane of the ring; j8 means that it is above 
the plane of the ring. The C-1 carbon atom is called the anomeric carbon 
atom, and the a and j8 forms are called anomers. An equilibrium mixture of 
glucose contains approximately one-third a anomer, two-thirds j8 anomer, 
and <1% of the open-chain form. 

The same nomenclature applies to the furanose ring form of fructose, 
except that a and (3 refer to the hydroxyl groups attached to C-2, the 
anomeric carbon atom (see Figure 11.5). Fructose forms both pyranose and 
furanose rings. The pyranose form predominates in fructose free in solu- 
tion, and the furanose form predominates in many fructose derivatives 
(Figure 11.6). Pentoses such as D-ribose and 2-deoxy-D-ribose form fura¬ 
nose rings, as we have seen in the structure of these units in RNA and DNA. 
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FIGURE 11.4 Pyranose formation. The 

open-chain form of glucose cyclizes when 
the C -5 hydroxyl group attacks the oxygen 
atom of the C -1 aldehyde group to form 
an intramolecular hemiacetal. Two anomeric 
forms, designated a and jS, can result. 


FIGURE 11.5 Furanose formation. The 

open-chain form of fructose cyclizes to a 
five-membered ring when the C -5 hydroxyl 
group attacks the C -2 ketone to form an 
intramolecular hemiketal. Two anomers are 
possible, but only the a anomer is shown. 
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FIGURE 11.6 Ríng structures of 
fructose. Fructose can form both five- 
membered furanose and six-membered 
pyranose rings. In each case, both a and 
jS anomers are possible. 


Steric hindrance 



C3-endo C2-endo 


FIGURE 11.8 Envelope conformations 

of jS-D-ríbose. The C2-endo and Cs-endo 
forms of jS-D-ribose are shown. The color 
indicates the four atoms that lie approxi- 
mately in a plane. 
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11.1.2 Conformation of Pyranose and Furanose Rings 

The six-membered pyranose ring is not planar, because of the tetrahedral 
geometry of its saturated carbon atoms. Instead, pyranose rings adopt two 
classes of conformations, termed chair and hoat because of the resemblance 
to these objects (Figure 11.7). In the chair form, the substituents on the ring 
carbon atoms have two orientations: axial and equatorial Axial bonds are 
nearly perpendicular to the average plane of the ring, whereas equatorial 
bonds are nearly parallel to this plane. Axial substituents sterically hinder 
each other if they emerge on the same side of the ring (e.g., 1,3-diaxial 
groups). In contrast, equatorial substituents are less crowded. The chair 
form of ^-D-glucopyranose predominates because all axial positions are occu- 
pied by hydrogen atoms. The bulkier -OH and -CH 2 OH groups emerge at 
the less-hindered periphery. The boat form of glucose is disfavored because 
it is quite sterically hindered. 


FIGURE 11.7 Chair and boat 
forms of j3-D-glucopyranose. 

The chair form is more stable 
because of less steric hindrance 
as the axial positions are occupied 
by hydrogen atoms. 


HO 




Furanose rings, like pyranose rings, are not planar. They can be puck- 
ered so that four atoms are nearly coplanar and the fifth is about 0.5 À away 
from this plane (Figure 11.8). This conformation is called an envelope form 
because the structure resembles an opened envelope with the back flap 
raised. In the ribose moiety of most biomolecules, either C-2 or C-3 is out 
of the plane on the same side as C-5. These conformations are called G 2 - 
endo and C 3 -endo, respectively. 


11.1.3 Monosaccharides Are Joined to Alcohols and Amines 
Through Glycosidic Bonds 

Monosaccharides can be modified by reaction with alcohols and amines to 
form adducts. For example, D-glucose will react with methanol in an acid- 
catalyzed process: the anomeric carbon atom reacts with the hydroxyl group 
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of methanol to form two products, methyl a-D-glucopyranoside and methyl 
jS-D-glucopyranoside. These two glucopyranosides differ in the configura- 
tion at the anomeric carbon atom. The new bond formed between the 
anomeric carbon atom of glucose and the hydroxyl oxygen atom of methanol 
is called a glycosidic bond —specifically, an O-glycosidic bond. The anomeric 
carbon atom of a sugar can be linked to the nitrogen atom of an amine to 
form an N-glycosidic bond. 

Indeed, we have previously encountered such reaction products; nu- 
cleosides are adducts between sugars such as ribose and amines such as ade- 
nine (Section 5.1.1). Some other important modified sugars are shown in 
Figure 11.9. Compounds such as methyl glucopyranoside show differences 
in reactivity from that of the parent monosaccharide. For example, un- 
modified glucose reacts with oxidizing agents such as cupric ion (Cu^^) be- 
cause the open-chain form has a free aldehyde group that is readily oxidized. 


V/H 


O, 




/OH 


CHoOH 


H OH 


H-C—OH 



H-C—OH 


Cu" 


Cu" 


CU2O 


HO- 


-H 


H 2 O, HO 


H-C—OH 

H-C—OH 

CH2OH 


Glycosides such as methyl glucopyranoside do not react, because they are 
not readily interconverted with a form that includes a free aldehyde group. 
Solutions of cupric ion (known as Fehling’s solution) provide a simple test 
for sugars such as glucose. Sugars that react are called reducing sugars; those 
that do not are called nonreducing sugars. 


O-Glycosidic bond 



Methyl a-D-glucopyranoside 


3 

H OH 

Methyl p-D-glucopyranoside 

/V-Glycosidic bond 



H OH 
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P-D-Acetylgalactosamine 

(GaiNAc) 


P-D-Acetylglucosamine 

(GlcNAc) 


Sialic add (Sia) 
(yV-Acetylneuraminate) 


H-C-OH 

H-C-OH 

CH2OH 


FIGURE 11.9 Modified monosaccharides. Carbohydrates can be modified by the 
addition of substituents (shown in red) other than hydroxyl groups. Such modified 
carbohydrates are often expressed on cell surfaces. 


11.2 COMPLEX CARBOHYDRATES ARE FORMED 
BY LINKAGE OF MONOSACCHARIDES 

Because sugars contain many hydroxyl groups, glycosidic bonds can join one 
monosaccharide to another. Oligosaccharides are built by the linkage of two 
or more monosaccharides by O-glycosidic bonds (Figure 11.10). In maltose, 
for example, two D-glucose residues are joined by a glycosidic linkage between 
the n-anomeric form of C-1 on one sugar and the hydroxyl oxygen atom on 
C-4 of the adjacent sugar. Such a linkage is called an n-1,4-glycosidic bond. 
The fact that monosaccharides have multiple hydroxyl groups means that 
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FIGURE 11.10 Maltose, a disaccharíde. 

Two molecules of glucose are linked by an 
üí-l, 4 -glycosidic bond to form the disaccha- 
ride maltose. 




various glycosidic linkages are possible. Indeed, the wide array of these link- 
ages in concert with the wide variety of monosaccharides and their many iso- 
meric forms makes complex carbohydrates information-rich molecules. 



(a-D-Glucopyranosyl-(1 ^2)-p-D-fructofuranose 



(P-D-Galactopyranosyl-(1 ^ 4)-a-D-glucopyranose 



(a-D-Glucopyranosyl-(1^4)-a-D-glucopyranose 

FIGURE 11.11 Common disaccharides. 

Sucrose, lactose, and maltose are common 
dietary components. 


11.2.1 Sucrose, Lactose, and Maltose Are the 
Common Disaccharides 

A disaccharíde consists of two sugars joined by an O-glycosidic bond. Three 
abundant disaccharides are sucrose, lactose, and maltose (Figure 11.11). 
Sucrose (common table sugar) is obtained commercially from cane or beet. 
The anomeric carbon atoms of a glucose unit and a fructose unit are joined 
in this disaccharide; the configuration of this glycosidic linkage is a for glu¬ 
cose and (3 for fructose. Sucrose can be cleaved into its component mono¬ 
saccharides by the enzyme sucrase. 

Lactose, the disaccharide of milk, consists of galactose joined to glucose 
by a j8-1,4-glycosidic linkage. Lactose is hydrolyzed to these monosaccha¬ 
rides by lactase in human beings (Section 16.1.12) and by jd-galactosidase in 
bactéria. In maltose, two glucose units are joined by an n -1,4 glycosidic link¬ 
age, as stated earlier. Maltose comes from the hydrolysis of starch and is in 
turn hydrolyzed to glucose by maltase. Sucrase, lactase, and maltase are 
located on the outer surfaces of epithelial cells lining the small intestine 
(Figure 11.12). 



FIGURE 11.12 Electron micrograph of a mícrovillus. Lactase and other enzymes that 
hydrolyze carbohydrates are present on microvilli that project from the outer face of the 
plasma membrane of intestinal epithelial cells. [From M. S. Mooseker and L. G. Tilney, 

J. Cell. Biol. 67(1975):725.] 


11.2.2 Glycogen and Starch Are Mobilizable Stores of Glucose 

Large polymeric oligosaccharides, formed by the linkage of multiple mono¬ 
saccharides, are called polysaccharides. Polysaccharides play vital roles in en- 
ergy storage and in maintaining the structural integrity of an organism. If all 
of the monosaccharides are the same, these polymers are called homopolymers. 
The most common homopolymer in animal cells is glycogen, the storage form 
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a-l,6-Glycosidic bond 


FIGURE 11.13 Branch point in glycogen. 

Two chains of glucose molecules joined by 
Q:-l,4-glycosidic bonds are linked by an 
Q:-l,6-glycosidic bond to create a branch 
point. Such an üí-l,6-glycosidic bond forms 
at approximately every 10 glucose units, 
making glycogen a highly branched molecule. 
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of glucose. As will be considered in detail in Chapter 21, glycogen is a very 
large, branched polymer of glucose residues. Most of the glucose units in glyco¬ 
gen are linked by a:-l,4-glycosidic bonds. The branches are formed by a:-l,6- 
glycosidic bonds, present about once in 10 units (Figure 11.13). 

The nutritional reservoir in plants is starch, of which there are two forms. 
Amylose, the unbranched type of starch, consists of glucose residues in 
n-1,4 linkage. Amylopectin, the branched form, has about 1 n-1,6 linkage 
per 30 o:-1,4 linkages, in similar fashion to glycogen except for its lower de- 
gree of branching. More than half the carbohydrate ingested by human 
beings is starch. Both amylopectin and amylose are rapidly hydrolyzed by 
a-amylase, an enzyme secreted by the salivary glands and the pancreas. 

11.2.3 Cellulose, the Major Structural Polymer of Plants, 

Consists of Linear Chains of Glucose Units 

Cellulose, the other major polysaccharide of glucose found in plants, serves a 
structural rather than a nutritional role. Cellulose is one of the most abundant 
organic compounds in the biosphere. Some 10^^ kg of cellulose is synthesized 
and degraded on Earth each year. It is an unbranched polymer of glucose 
residues joined by j8-1,4 linkages. The j8 configuration allows cellulose to form 
very long, straight chains. Fibrils are formed by parallel chains that interact 
with one another through hydrogen bonds. The o:-1,4 linkages in glycogen 
and starch produce a very different molecular architecture from that of cellu¬ 
lose. A hollow helix is formed instead of a straight chain (Figure 11.14). These 
differing consequences of the a and (3 linkages are biologically important. The 
straight chain formed by (3 linkages is optimal for the construction of fibers 
having a high tensile strength. In contrast, the open helix formed by a link¬ 
ages is well suited to forming an accessible store of sugar. Mammals lack cel- 
lulases and therefore cannot digest wood and vegetable fibers. 


HOH2C 


HOHnC 







HOH 2 C 


Cellulose 

((3-1,4 linkages) 



Starch and Glycogen 

(a-1,4 linkages) 


FIGURE 11.14 Glycosidk bonds 
determine polysaccharide structure. 

The /3-l,4 linkages favor straight chains, 
which are optimal for structural purposes. 
The «-1,4 linkages favor bent structures, 
which are more suitable for storage. 
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CHAPTER 11 • carbohydrates Made of Repeatíng Dísaccharíde Units 

A different kind of repeating polysaccharide is present on the animal cell 
surface and in the extracellular matrix. Many glycosaminoglycans are made 
of disaccharide repeating units containing a derivative of an amino sugar, ei- 
ther glucosamine or galactosamine (Figure 11.15). At least one of the sug¬ 
ar s in the repeating unit has a negatively charged carboxylate or sulfate group. 
Chondroitin sulfate, keratan sulfate, heparin, heparan sulfate, dermatan sul¬ 
fate, and hyaluronate are the major glycosaminoglycans. 

Glycosaminoglycans are usually attached to proteins to form proteo- 
glycans. Heparin is synthesized in a nonsulfated form, which is then deacet- 
ylated and sulfated. Incomplete modification leads to a mixture of vari- 
ously sulfated sequences. Some of them act as anticoagulants by binding 
specifically to antithrombin, which accelerates its sequestration of throm- 
bin (Section 10.7.8). Heparan sulfate is like heparin except that it has fewer 
N- and O-sulfate groups and more acetyl groups. 

Proteoglycans resemble polysaccharides more than proteins in as much 
as the carbohydrate makes up as much as 95% of the biomolecule by weight. 
Proteoglycans function as lubricants and structural components in connec- 
tive tissue, mediate adhesion of cells to the extracellular matrix, and bind 
factors that stimulate cell proliferation. 






Chondroitin 6-sulfate 

FIGURE 11.15 Repeating units in 
glycosaminoglycans. Structural formulas 
for five repeating units of important 
glycosaminoglycans illustrate the variety of 
modifications and linkages that are possible. 

Amino groups are shown in blue and 
negatively charged groups in red. Hydrogens 
have been omitted for clarity. The right-hand 
structure is glucosamine in each case. 


Keratan sulfate 


Heparin 



11.2.5 Specific Enzymes Are Responsible for Oligosaccharide 
Assembly 

Oligosaccharides are synthesized through the action of specific enzymes, 
glycosyltransferases, which catalyze the formation of glycosidic bonds. Each 
enzyme must be specific, to a greater or lesser extent, to the sugars being 
linked. Given the diversity of known glycosidic linkages, many different en¬ 
zymes are required. Note that this mode of assembly stands in contrast with 
those used for the other biological polymers heretofore discussed—that is, 
polypeptides and oligonucleotides. As these polymers are assembled, In¬ 
formation about monomer sequence is transferred from a template, and a 
single catalytic apparatus is responsible for all bond formation. 

The general form of the reaction catalyzed by a glycosyltransferase is 
shown in Figure 11.16. The sugar to be added comes in the form of an acti- 
vated sugar nucleotide. Sugar nucleotides are important intermediates in many 
processes, and we will encounter these intermediates again in Chapters 16 
and 21. Note that such reactions can proceed with either retention or 
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FIGURE 11.16 General form of a 
glycosyltransferase reactíon. The 

sugar to be added comes from a sugar 
nucleotide-in this case, UDP-glucose. 


inversion of configuration at the glycosidic carbon atom at which the new bond 
is formed; a given enzyme proceeds by one stereochemical path or the other. 

The human ABO blood groups illustrate the effects of glycosyl- 
transferases. Carbohydrates are attached to glycoproteins and glycol- 
ipids on the surfaces of red blood cells. For one type of blood group, one of 
the three different structures, termed A, B, and O, may be present (Figure 
11.17). These structures have in common an oligosaccharide foundation called 
the O (or sometimes H) antigen. The A and B antigens 
differ from the O antigen by the addition of one extra 
monosaccharide, either N-acetylgalactosamine (for A) or 
galactose (for B) through an n-1,3 linkage to a galactose 
moiety of the O antigen. 

Specific glycosyltransferases add the extra monosac¬ 
charide to the O antigen. Each person inherits the gene 
for one glycosyltransferase of this type from each parent. 

The type A transferase specifically adds N-acetylgalac- 
tosamine, whereas the type B transferase adds galactose. 

These enzymes are identical in all but 4 of 354 positions. 

The O phenotype is the result of a mutation that leads to 
premature termination of translation and, hence, to the 
production of no active glycosyltransferase. 

These structures have important implications for 
blood transfusions and other transplantation procedures. 

If an antigen not normally present in a person is intro- 
duced, the person s immune system recognizes it as for- 
eign. Adverse reactions can ensue, initiated by the in¬ 
travascular destruction of the incompatible red blood cells. 
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FIGURE 11.17 Structures of A, B, and O oligosaccharide 
antigens. Abbreviations: Fuc, fucose; Gal, galactose; GaiNAc, 
A/-acetylgalactosamine; GlcNAc, A/-acetylglu cosa mine. 


Y Why are different blood types present in the human population? 

Suppose that a pathogenic organism such as a parasite expresses on 
its cell surface a carbohydrate antigen similar to one of the blood-group 
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antigens. This antigen may not be readily detected as foreign in a person 
with the blood type that matches the parasite antigen, and the parasite will 
flourish. However, other people with different blood types will be protected. 
Hence, there will be selective pressure on human beings to vary blood type 
to prevent parasitic mimicry and a corresponding selective pressure on par¬ 
asites to enhance mimicry. The constant ‘‘arms race’' between pathogenic 
microorganisms and human beings drives the evolution of diversity of sur- 
face antigens within the human population. 


11.3 CARBOHYDRATES CAN BE ATTACHED TO PROTEINS 
TO FORM GLYCOPROTEINS 


FIGURE 11.18 Glycosídíc bonds 
between proteíns and carbohydrates. 

A glycosidic bond links a carbohydrate to 
the side chain of asparagine (A/-linked) or 
to the side chain of serine or threonine 
(0-linked). The glycosidic bonds are shown 
in red. 


Abbreviations for sugars— 


Fuc 

Fucose 

Gal 

Galactose 

GaiNAc 

A/-Acetylgalactosamine 

Glc 

Glucose 

GlcNAc 

A/-Acetylglu cosa mine 

Man 

Mannose 

Sia 

Sialic acid 

NeuNAc 

A/-Acety 1 n e u ra m i n a te 
(sialic acid) 


Carbohydrate groups are covalently attached to many different proteins to 
form glycoproteins. Carbohydrates are a much smaller percentage of the 
weight of glycoproteins than of proteoglycans. Many glycoproteins are com- 
ponents of cell membranes, where they play a variety of roles in processes 
such as cell adhesion and the binding of sperm to eggs. 

11.3.1 Carbohydrates May Be Linked to Proteins 
Through Asparagine (/V-Linked) or Through Serine 
or Threonine (0-Linked) Residues 

In glycoproteins, sugars are attached either to the amide nitrogen atom in 
the side chain of asparagine (termed an N-linkage) or to the oxygen atom in 
the side chain of serine or threonine (termed an O-linkage), as shown in 
Figure 11.18. An asparagine residue can accept an oligosaccharide only if 
the residue is part of an Asn-X-Ser or Asn-X-Thr sequence, in which X can 
be any residue. Thus, potential glycosylation sites can be detected within amino 
acid sequences. However, which of these potential sites is actually glyco- 
sylated depends on other aspects of the protein structure and on the cell type 
in which the protein is expressed. All N-linked oligosaccharides have in 
common a pentasaccharide core consisting of three mannose and two N- 
acetylglucosamine residues. Additional sugars are attached to this core to 
form the great variety of oligosaccharide patterns found in glycoproteins 
(Figure 11.19). 
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FIGURE 11.19 Af-linked oligosaccharides. A pentasaccharide core (shaded yellow) is 
common to all A/-linked oligosaccharides and serves as the foundation for a wide variety of 
A/-linked oligosaccharides, two of which are illustrated: (A) high-mannose type; (B) complex 
type. Detailed Chemical formulas and schematic structures are shown for each type. 
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FIGURE 11.20 Elastase, a 
secreted glycoproteín, showíng linked 
carbohydrates on its surface. Elastase 
is a protease found in serum. Note that 
the oligosaccharide chains have substantial 
size even for this protein, which has a 
relatively low levei of glycosylation. 


Carbohydrates are linked to some soluble proteins as well as membrane 
proteins. In particular, many of the proteins secreted from cells are glyco- 
sylated. Most proteins present in the serum component of blood are glyco¬ 
proteins (Figure 11.20). Furthermore, N-acetylglucosamine residues are 
O-linked to some intracellular proteins. The role of these carbohydrates, 
which are dynamically added and removed, is under active investigation. 

11.3.2 Protein Glycosylation Takes Place in the Lumen of the 
Endoplasmic Reticuium and in the Golgi Complex 

Protein glycosylation takes place inside the lumen of the endoplasmic retic¬ 
uium (ER) and the Golgi complex, organelles that play central roles in pro¬ 
tein trafficking (Figure 11.21). One such glycoprotein (depicted in Figure 
11.20) is the proteolytic enzyme elastase (Section 9.1.4), which is secreted 
by the pancreas as a zymogen (Section 10.7). This protein is synthesized by 
ribosomes attached to the cytoplasmic face of the ER membrane, and the 
peptide chain is inserted into the lumen of the ER as it grows, guided by a 
signal sequence of 29 amino acids at the amino terminus. This signal se- 
quence, which directs the protein through a channel in the ER membrane, 
is cleaved from the protein in the transport process into the ER (Figure 
11.22). After the protein has entered the ER, the glycosylation process be- 
gins. The N-linked glycosylation begins in the ER and continues in the 
Golgi complex, whereas the O-linked glycosylation takes place exclusively 
in the Golgi complex. 



FIGURE 11.21 Golgi complex and 
endoplasmic reticuium. The electron 
micrograph shows the Golgi complex and 
adjacent endoplasmic reticuium. The black 
dots on the cytoplasmic surface of the 
ER membrane are ribosomes. [Micrograph 
courtesy of Lynne Mercer.] 



FIGURE 11.22 Transport into the 
endoplasmic reticuium. As translation 
takes place, a signal sequence on 
membrane and secretory proteins directs 
the nascent protein through channels in 
the ER membrane and into the lumen. 

In most cases, the signal sequence is 
subsequentiy cleaved and degraded. 
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Isoprene 


11.3.3 Af-Linked Glycoproteins Acquire Their Initial Sugars 
from Dolichol Donors in the Endoplasmic Reticuium 

A large oligosaccharide destined for attachment to the asparagine residue of 
a protein is assembled attached to dolichol phosphate, a specialized lipid mol- 
ecule containing as many as 20 isoprene (C 5 ) units (Section 26.x.x). 


2- 

n = 15-19 

Dolichol phosphate 

The terminal phosphate group is the site of attachment of the activated 
oligosaccharide, which is subsequently transferred to the protein acceptor. 
Dolichol phosphate resides in the ER membrane with its phosphate termi- 
nus on the cytoplasmic face. 

The assembly process proceeds in three stages. First, 2 iV-acetylglu- 
cosamine residues and 5 mannose residues are added to the dolichol phos¬ 
phate through the action of a number of cytoplasmic enzymes that catalyze 
monosaccharide transfer from sugar nucleotides. Then, in a remarkable 
(and, as yet, not well understood) process, this large structure is ‘Tipped’' 
through the ER membrane into the lumen of the ER. Finally, additional 
sugars are added by enzymes in the ER lumen, this time with the use of 
monosaccharides activated by attachment to dolichol phosphate. This 
process ends with the formation of a 14-residue oligosaccharide attached to 
dolichol phosphate (Figure 11.23). 

The 14-sugar-residue precursor attached to this dolichol phosphate in- 
termediate is then transferred en bloc to a specific asparagine residue of the 
growing polypeptide chain. In regard to elastase, oligosaccharides are linked 




\JDP-Çy 
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Stage 1 


-Y- 

Stage 2 


Stage 3 


FIGURE 11.23 Assembly of an yv-linked oligosaccharide precursor on dolichol 
phosphate. The first stage of oligosaccharide synthesis takes place in the cytoplasm 
on the exposed phosphate of a membrane-embedded dolichol molecule. Synthesis of the 
precursor is completed in the lumen of the ER after flipping of the dolichol phosphate 
and attached oligosaccharide. 
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to the asparagine residues in the recognition sequences Asn 109-Gly-Ser 
and Asn 159-Val-Thr. Both the activated sugars and the complex enzyme 
that is responsible for transferring the oligosaccharide to the protein are lo- 
cated on the lumenal side of the ER, accounting for the fact that proteins in 
the cytosol are not glycosylated by this pathway. Before the glycoprotein 
leaves the lumen of the ER, 3 glucose molecules are removed from the 14- 
residue oligosaccharide. As we will see in Section 11.3.6, the sequential re¬ 
mo vai of these glucose molecules is a quality-control step that ensures that 
only properly folded glycoproteins are further processed. 

T Dolichol pyrophosphate released in the transfer of the oligosaccharide 
to the protein is recycled to dolichol phosphate by the action of a phos- 
phatase. This hydrolysis is blocked by bacitracin, an antibiotic. Another in- 
teresting antibiotic inhibitor of N-glycosylation is tunicamycin, a hydropho- 
bic analog of the sugar nucleotide uridine diphosphate-N-acetylglucosamine 
(UDP-GlcNAc), the activated form of N-acetylglucosamine used as a sub- 
strate for the enzymes that synthesize the oligosaccharide unit on dolichol 
phosphate. Tunicamycin blocks the addition of N-acetylglucosamine to 
dolichol phosphate, the first step in the formation of the core oligosaccharide. 

11.3.4 Transport Vesicles Carry Proteins from the Endoplasmic 
Reticuium to the Golgi Complex for Further Glycosylation 
and Sorting 

Proteins in the lumen of the ER and in the ER membrane are transported to 
the Golgi complex, which is a stack of flattened membranous sacs. The Golgi 
complex has two principal roles. First, carbohydrate units of glycoproteins are 
altered and elaborated in the Golgi complex. The 0-linked sugar units are fash- 
ioned there, and the N-linked sugars, arriving from the ER as a component 
of a glycoprotein, are modified in many different ways. Second, the Golgi 
complex is the major sorting center of the cell. Proteins proceed from the Golgi 
complex to lysosomes, secretory granules (as is the case for the elastase zy- 
mogen), or the plasma membrane, according to signals encoded within their 
amino acid sequences and three-dimensional structures (Figure 11.24). 


—I 309 I - 

Glycoproteins 




FIGURE 11.24 Golgi complex as 
sorting center. The Golgi complex is the 
sorting center in the targeting of proteins 
to lysosomes, secretory vesicles, and the 
plasma membrane. The cis face of the 
Golgi complex receives vesicles from the 
ER, and the trans face sends a different 
set of vesicles to target sites. Vesicles aiso 
transfer proteins from one compartment 
of the Golgi complex to another. [Courtesy 
of Dr. Marilyn Farquhar.] 
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FIGURE 11.25 Formatíon of a 
mannose 6-phosphate marker. A 

glycoprotein destined for delivery to 
lysosomes acquires a phosphate marker 
in the cis Golgi compartment in a two-step 
process. First, a phosphotransferase adds a 
phospho-A/-acetylglucosamine unit to the 
6-OH group of a mannose, and then a 
phosphodiesterase removes the added 
sugar to generate a mannose 6-phosphate 
residue in the core oligosaccharide. 


The Golgi complex of a typical mammalian cell has 3 or 4 membranous 
sacs (cisternae), and those of many plant cells have about 20. The Golgi 
complex is differentiated into (1) a cis compartment, the receiving end, 
which is closest to the ER; (2) medial compartments; and (3) a trans 
compartment, which exports proteins to a variety of destinations. These 
compartments contain different enzymes and mediate distinctive functions. 

The N-linked carbohydrate units of glycoproteins are further modified 
in each of the compartments of the Golgi complex. In the cis Golgi com¬ 
partment, three mannose residues are removed from the oligosaccharide 
chains of proteins destined for secretion or for insertion in the plasma 
membrane. The carbohydrate units of glycoproteins targeted to the lumen 
of lysosomes are further modified. In the medial Golgi compartments 
of some cells, two more mannose residues are removed, and two N- 
acetylglucosamines residues and a fucose residue are added. Finally, in the trans 
Golgi, another N-acetylglucosamine residue can be added, followed by 
galactose and sialic acid, to form a complex oligosaccharide unit. The se- 
quence of N-linked oligosaccharide units of a glycoprotein is determined 
both by (1) the sequence and conformation of the protein undergoing gly- 
cosylation and by (2) the glycosyltransferases present in the Golgi com¬ 
partment in which they are processed. Note that, despite all of this Proc¬ 
essing, N-glycosylated proteins have in common a pentasaccharide core (see 
Figure 11.19). Carbohydrate processing in the Golgi complex is called term¬ 
inal glycosylation to distinguish it from core glycosylation, which takes place 
in the ER. Tremendous structural diversification can occur as a result of the 
terminal glycosylation process. 

11.3.5 Mannose 6-phosphate Targets Lysosomal Enzymes 
to Their Destinations 

7 A carbohydrate marker directs certain proteins from the Golgi com¬ 
plex to lysosomes. A clue to the identity of this marker carne from 
analyses of 1-cell disease (also called mucolipidosis II), a lysosomal storage dis- 
ease. Lysosomes are organelles that degrade and recycle damaged cellular com- 
ponents or material brought into the cell by endocytosis. Patients with I-cell 
disease suffer severe psychomotor retardation and skeletal deformities. Their 
lysosomes contain large inclusions of undigested glycosaminoglycans (Section 
11.2.4) and glycolipids (Section 12.2.3)—hence the ‘‘P' in the name of the dis¬ 
ease. These inclusions are present because at least eight acid hydrolases re- 
quired for their degradation are missing from affected lysosomes. In contrast, 
very high leveis of the enzymes are present in the blood and urine. Thus, ac¬ 
tive enzymes are synthesized, but they are exported instead of being se- 
questered in lysosomes. In other words, a whole series of enzymes is mislocated 
in I-cell disease. Normally, these enzymes contain a mannose 6-phosphate 
residue, but, in I-cell disease, the attached mannose is unmodified (Figure 
11.25). Mannose 6-phosphate is infact the marker that normally directs many 
hydrolytic enzymes from the Golgi complex to lysosomes. I-cell patients are de- 
ficient in the phosphotransferase catalyzing the first step in the addition of the 
phosphoryl group; the consequence is the mistargeting of eight essential enzymes. 

11.3.6 Glucose Residues Are Added and Trimmed to Aid 
in Protein Folding 

The oli gosaccharide precursors added to proteins may play a role in protein 
(folding) as well as in protein targeting. As we have seen, before a glycoprotein 
leaves the ER, two glucosidases cleave the three glucose residues of the 
oligosaccharide in a step-by-step fashion. If the protein is properly folded, it 
moves to the Golgi complex for further processing (Section 11.3.3). However, 
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FIGURE 11.26 Quality-control system 
for protein foldíng in the ER. A properly 
folded glycoprotein wili move to the 
Golgi complex after the removal of 
glucose moieties (shown in red). An 
unfolded or misfolded protein wilI receive 
a glucose residue, through the action of 
a glucosyltransferase. Such glucosylated 
glycoproteins bind to calnexin (or the 
related protein calreticulin), which serves 
as a chaperone to allow multiple attempts 
to attain correct folding. Properly folded 
proteins are not reglucosylated. 


if the protein is sufficiently unfolded that the oligosaccharide can act as a 
substrate for glucosyltransferase, another enzyme residing in the lumen of 
the ER, a glucose residue wili be reattached (Figure 11.26). This residue, in 
turn, is bound by one of two chaperone proteins called calnexin and calretic¬ 
ulin. Calnexin, the more fully understood of the two proteins, is membrane 
bound, whereas calreticulin is a soluble component of the ER lumen. Un¬ 
folded proteins held by these carbohydrate-binding proteins (lectins, 
Section 11.4) cannot leave the ER, giving the unfolded proteins time to fold 
properly. When a chaperone releases the bound protein, the glucose residue 
wili be cleaved by a glucosidase. If the folding is correct, the protein moves 
to the Golgi complex. Otherwise, the protein wili repeat another cycle of 
glucose addition and binding until the glucose-free (and, hence, properly 
folded) protein can be translocated to the G olgi complex. This quality- 
contr ol System reveals an important pnnciple: {carbohydrates carry informa-) 
(tiQn.) Here, the availability of carbohydrates to specific glycosyltransferases 
conveys Information about the folding state of the protein. Moreover, we 
see the reiteration of a theme in the control of protein folding: other chap¬ 
erone proteins rely on the same essential mechanism of allowing misfolded 
proteins multiple attempts to reach a folded state (Section 3.6), even though 
carbohydrate modification is not a part of their reaction cycles. 

11.3.7 Oligosaccharides Can Be "Sequenced'' 

Given the large diversity of oligosaccharide structures and the many possi- 
ble points of attachment to most proteins, how is it possible to determine the 
structure of a glycoprotein? Most approaches are based on the use of enzymes 
that cleave oligosaccharides at specific types of linkages. For example, N- 
linked oligosaccharides can be released from proteins by an enzyme such as 
Peptide N-glycosidase F, which cleaves the iV-glycosidic bonds linking the 
oligosaccharide to the protein. The oligosaccharides can then be isolated and 
analyzed. Through the use of MALDI-TOF or other mass spectrometric 
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FIGURE 11.27 Mass spectrometríc 
"sequencing" of oligosaccharídes. 

Carbohydrate-cleaving enzymes were 
used to release and specifically cleave 
the oligosaccharide component of the 
glycoprotein fetuin from bovine serum. 

Parts A and B show the masses obtained 
with MALDI-TOF spectrometry as well as 
the corresponding structures of the 
oligosaccharide digestion products (using 
the same scheme as that in Figure 11.19): 
(A) digestion with Peptide A/-glycosidase F 
(to release the oligosaccharide from the 
protein) and neuraminidase; (B) digestion 
with Peptide A/-glycosidase F, neuraminidase, 
and jS-l,4-galactosidase. Knowledge of the 
enzyme specificities and the masses of the 
products permits the characterization of the 
oligosaccharide. [After A. Varki, R. Cummings, 

J. Esko, H. Freeze, G. Hart, and J. Marth (Eds.), 
Essentials of Glycobiology (Cold Spring Harbor 
Laboratory Press, 1999), p. 596.] 





techniques (Section 4.1.7), the mass of an oligosaccharide fragment can be 
determined. However, given the large number of potential monosaccharide 
combinations, many possible oligosaccharide structures are consistent with 
a given mass. More complete Information can be obtained by cleaving the 
oligosaccharide with enzymes of varying specificities. For example, j8-l,4- 
galactosidase cleaves j8-glycosidic bonds exclusively at galactose residues. 
The products can again be analyzed by mass spectrometry (Figure 11.27). 
The repetition of this process with the use of an array of enzymes of differ- 
ent specificity will eventually reveal the structure of the oligosaccharide. 

The points of oligosaccharide attachment can be determined through the 
use of proteases. Cleavage of a protein by applying specific proteases yields 
a characteristic pattern of peptide fragments that can be analyzed chro- 
matographically (Section 4.2.1). The chromatographic properties of pep- 
tides attached to oligosaccharides will change on glycosidase treatment. 
Mass spectrometric analysis or direct peptide sequencing can reveal the 
identity of the peptide in question and, with additional effort, the exact site 
of oligosaccharide attachment. 

Posttranslational modifications such as glycosylation greatly increase the 
complexity of the proteome. A given protein with several potential glycosi- 
dation sites can have many different glycosylated forms (sometimes called 
glycoforms), each of which may be generated only in a specific cell type or 
developmental stage. Now that the sequencing of the human genome is es- 
sentially complete, the characterization of the much more complex pro¬ 
teome, including the biological roles of specifically modified proteins, can 
begin in earnest. 



11.4 LECTINS ARE SPECIFIC 
CARBOHYDRATE-BINDING PROTEINS 

The diverse carbohydrate structures displayed on cell surfaces are well 
suited to serve as interaction sites between cells and their environments. 
Proteins termed lectins (from the Latin legere, ‘‘to select’') are the partners 
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that bind specific carbohydrate structures. Lectins are ubiquitous, being | 313 | 

found in animais, plants, and microorganisms. We have already seen that Lectins 

some lectins, such as calnexin, function as chaperones in protein folding 
(Section 11.3.6). 

11.4.1 Lectins Promote Interactions Between Cells 

The chief function of lectins in animais is to facilitate cell-cell contact. A 
lectin usually contains two or more binding sites for carbohydrate units; 
some lectins form oligomeric structures with multiple binding sites. The 
binding sites of lectins on the surface of one cell interact with arrays of car- 
bohydrates displayed on the surface of another cell. Lectins and carbohy- 
drates are linked by a number of relatively weak interactions that ensure 
specificity yet permit unlinking as needed. The interactions between one 
cell surface with carbohydrates and another with lectins resemble the action 
of Velcro; each interaction is relatively weak but the composite is strong. 

The exact role of lectins in plants is unclear, although they can serve as 
potent insecticides. Castor beans contain so much lectin that they are toxic 
to most organisms. The binding specificities of lectins from plants have been 
well characterized (Figure 11.28). Bactéria, too, contain lectins. Escherichia 
coli bactéria are able to adhere to epithelial cells of the gastrointestinal tract 
because lectins on the E. coli surface recognize oligosaccharide units on the 
surfaces of target cells. These lectins are located on slender hairlike ap- 
pendages cslled fimbriae (pili). 



GlcNAc 

lyS-1,4 

GlcNAc Gal 

lyS-1,4 |y8-l,3 

GlcNAc GaINAc 

Binds to wheat germ agglutinin Binds to peanut lectin 


Gal Gal 

y3-l,4| l/J-1,4 

GlcNAc GlcNAc 

Binds to phytohemagglutinin 


FIGURE 11.28 Binding selectivities of plant lectins. The plant lectins wheat germ 
agglutinin, peanut lectin, and phytohemagglutinin recognize different oligosaccharides. 



Lectins can be divided into classes on the basis of their amino acid se- 
quences and biochemical properties. One large class is the C type (for calcium- 
requiring) found in animais. These proteins have in common a domain of 120 
amino acids that is responsible for carbohydrate binding. The structure of one 
such domain bound to a carbohydrate target is shown in Figure 11.29. 



FIGURE 11.29 Structure of a C-type carbohydrate-binding domain from an 
animal lectin. A calcium ion links a mannose residue to the lectin. Selected interactions 
are shown, with some hydrogen atoms omitted for clarity. 
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FIGURE 11.30 Selectins medíate 
cell-cell ínteractions. The scanning 
electron micrograph shows lymphocytes 
adhering to the endothelial lining of a 
lymph node. The L selectins on the 
lymphocyte surface bind specifically to 
carbohydrates on the lining of the lymph- 
node vessels. [Courtesy of Dr. Eugene Butcher.] 


A calcium ion acts as a bridge between the protein and the sugar through 
direct Ínteractions with sugar hydroxyl groups. In addition, two glutamate 
residues in the protein bind to both the calcium ion and the sugar, while 
other protein side chains form hydrogen bonds with other hydroxyl groups 
on the carbohydrate. Changes in the amino acid residues that interact 
with the carbohydrate alter the carbohydrate-binding specificity of the 
lectin. 


T Proteins termed selectins are members of the C-type family. Selectins 
bind immune-System cells to the sites of injury in the inflammatory 
response (Figure 11.30). The L, E, and P forms of selectins bind specifi¬ 
cally to carbohydrates on /ymph-node vessels, endothelium, or activated 
blood platelets, respectively. New therapeutic agents that control inflam- 
mation may emerge from a deeper understanding of how selectins bind and 
distinguish different carbohydrates. 

11.4.2 Influenza Virus Binds to Sialic Acid Residues 

7 The ability of viruses to infect specific cell types is dictated in part 
by the ability of these viruses to bind to particular structures or re- 
ceptors on the surfaces of cells. In some cases, these receptors are carbohy¬ 
drates. For example, influenza virus recognizes sialic acid residues present 
on cell-surface glycoproteins. The virai protein that binds to these sugars is 
called hemagglutinin (Figure 11.31). 

After these surface ínteractions have taken place and the virus has been 
taken into the cell, another virai protein, neuramidase, cleaves the glyco- 
sidic bonds to the sialic acid residues, freeing the virus to infect the cell. 
Inhibitors of this enzyme are showing some promise as anti-influenza 
agents. 




FIGURE 11.31 Structure of a part of influenza hemagglutinin. This virai 
protein has multiple binding sites for linking to sialic acid residues on the target-cell surface. 
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SUMMARY 


Monosaccharides Are Aldehydes or Ketones with Multiple 
Hydroxyl Groups 

An aldose is a carbohydrate with an aldehyde group (as in glyceralde- 
hyde and glucose), whereas a ketose contains a keto group (as in dihy- 
droxyacetone and fructose). A sugar belongs to the D series if the ab- 
solute configuration of its asymmetric carbon farthest from the aldehyde 
or keto group is the same as that of D-glyceraldehyde. Most naturally 
occurring sugars belong to the D series. The C-1 aldehyde in the open- 
chain form of glucose reacts with the C-5 hydroxyl group to form a six- 
membered pyranose ring. The C-2 keto group in the open-chain form 
of fructose reacts with the C-5 hydroxyl group to form a five-membered 
furanose ring. Pentoses such as ribose and deoxyribose also form fura- 
nose rings. An additional asymmetric center is formed at the anomeric 
carbon atom (C-1 in aldoses and C-2 in ketoses) in these cyclizations. 
The hydroxyl group attached to the anomeric carbon atom is below the 
plane of the ring (viewed in the standard orientation) in the a anomer, 
whereas it is abo ve the ring in the j8 anomer. Not all the atoms in the 
rings lie in the same plane. Rather, pyranose rings usually adopt the 
chair conformation, and furanose rings usually adopt the envelope con- 
formation. Sugars are joined to alcohols and amines by glycosidic 
bonds from the anomeric carbon atom. For example, iV-glycosidic 
bonds link sugars to purines and pyrimidines in nucleotides, RNA, 
and DNA. 

Complex Carbohydrates Are Formed by Linkage of Monosaccharides 

Sugars are linked to one another in disaccharides and polysaccharides 
by 0-glycosidic bonds. Sucrose, lactose, and maltose are the common 
disaccharides. Sucrose (common table sugar), obtained from cane or beet, 
consists of n-glucose and j8-fructose joined by a glycosidic linkage be- 
tween their anomeric carbon atoms. Lactose (in milk) consists of galac- 
tose joined to glucose by a j8-l,4 linkage. Maltose (from starch) consists 
of two glucoses joined by an n-1,4 linkage. Starch is a polymeric form 
of glucose in plants, and glycogen serves a similar role in animais. Most 
of the glucose units in starch and glycogen are in a:-l,4 linkage. Glyco¬ 
gen has more branch points formed by n-1,6 linkages than does starch, 
which makes glycogen more soluble. Cellulose, the major structural 
polymer of plant cell walls, consists of glucose units joined by j8-l,4 
linkages. These j8 linkages give rise to long straight chains that form fi- 
brils with high tensile strength. In contrast, the a linkages in starch and 
glycogen lead to open helices, in keeping with their roles as mobilizable 
energy Stores. Cell surfaces and extracellular matrices of animais con- 
tain polymers of repeating disaccharides called glycosaminoglycans. 
One of the units in each repeat is a derivative of glucosamine or galac- 
tosamine. These highly anionic carbohydrates have a high density of 
carboxylate or sulfate groups. Proteins bearing covalently linked gly¬ 
cosaminoglycans are termed proteoglycans. 

Carbohydrates Can Attach to Proteins to Form Glycoproteins 

Specific enzymes link the oligosaccharide units on proteins either to the 
side-chain oxygen atom of a serine or threonine residue or to the side- 
chain amide nitrogen atom of an asparagine residue. Protein glycosyla- 
tion takes place in the lumen of the endoplasmic reticulum. The N- 
linked oligosaccharides are synthesized on dolichol phosphate and 
subsequently transferred to the protein acceptor. Additional sugars are 
attached in the Golgi complex to form diverse patterns. 
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-1 316 I - • Lectins Are Specific Carbohydrate-Binding Proteins 

CHAPTER 11 • Carbohydrates Carbohydrates are recognized by proteins called lectins, which are 

found in animais, plants, and microorganisms. In animais, the interplay 
of lectins and their sugar targets guides cell-cell contact. The virai pro- 
tein hemagglutinin on the surface of the influenza virus recognizes sialic 
acid residues on the surfaces of the cells invaded by the virus. A small 
number of carbohydrate residues can be joined in many different ways 
to form highly diverse patterns that can be distinguished by the lectin 
domains of protein receptors. 



-1 KEY TERMS 

monosaccharide (p. 296) 
triose (p. 296) 
ketose (p. 296) 
aldose (p. 296) 
enantiomer (p. 296) 
tetrose (p. 296) 
pentose (p. 296) 
hexose (p. 296) 
heptose (p. 296) 
diastereoisomer (p. 296) 
epimer (p. 297) 
hemiacetal (p. 298) 
pyranose (p. 298) 


hemiketal (p. 298) 
furanose (p. 299) 
anomer (p. 299) 
glycosidic bond (p. 301) 
reducing sugar (p. 301) 
nonreducing sugar (p. 301) 
oligosaccharide (p. 301) 
disaccharide (p. 302) 
polysaccharide (p. 302) 
glycogen (p. 302) 
starch (p. 303) 
cellulose (p. 303) 
glycosaminoglycan (p. 304) 


proteoglycan (p. 304) 
glycosyltransferase (p. 304) 
glycoprotein (p. 306) 
endoplasmic reticulum (p. 307) 
Golgi complex (p. 307) 
dolichol phosphate (p. 308) 
lectin (p. 312) 
selectin (p. 314) 
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-1 PROBLEMS I- 

1. Word origin. Account for the origin of the term carbohydrate. 

2. Diversity. How many different oligosaccharides can be made 
by linking one glucose, one mannose, and one galactose? Assume 
that each sugar is in its pyranose form. Compare this number 
with the number of tripeptides that can be made from three dif¬ 
ferent amino acids. 

3. Couples. Indicate whether each of the foliowing pairs of sug- 
ars consists of anomers, epimers, or an aldose-ketose pair: 

(a) D-glyceraldehyde and dihydroxyacetone 

(b) D-glucose and D-mannose 

(c) D-glucose and D-fruetose 

(d) a-D-glucose and jS-D-glucose 

(e) D-ribose and D-ribulose 

(f) D-galactose and D-glucose 

4. Tollen’s test. Glucose and other aldoses are oxidized by an 
aqueous solution of a silver-ammonia complex. What are the re- 
action Products? 

5. Mutarotation. The specific rotations of the a and j8 anomers 
of D-glucose are +112 degrees and +18.7 degrees, respectively. 
Specific rotation, [njo, is defined as the observed rotation of 
light of wavelength 589 nm (the D line of a sodium lamp) pass- 
ing through 10 cm of a 1 g mP^ solution of a sample. When a 
crystalline sample of n-D-glucopyranose is dissolved in water, 
the specific rotation decreases from 112 degrees to an equilib- 
rium value of 52.7 degrees. On the basis of this result, what are 
the proportions of the a and j8 anomers at equilibrium? Assume 
that the concentration of the open-chain form is negligible. 

6. Telltale adduet. Glucose reacts slowly with hemoglobin and 
other proteins to form covalent compounds. Why is glucose re- 
active? What is the nature of the adduet formed? 

7. Periodate cleavage. Compounds containing hydroxyl 
groups on adjacent carbon atoms undergo carbon-carbon 
bond cleavage when treated with periodate ion (I04~). How 
can this reaction be used to distinguish between pyranosides 
and furanosides? 

8. Oxygen source. Does the oxygen atom attached to C-1 in 
methyl n-D-glucopyranoside come from glucose or methanol? 


9. Sugar lineup. Identify the following four sugars. 





10. Cellular glue. A trisaccharide unit of a cell-surface glyco- 
protein is postulated to play a criticai role in mediating cell-cell 
adhesion in a particular tissue. Design a simple experiment to 
test this hypothesis. 

11. Mapping the molecule. Each of the hydroxyl groups of glucose 
can be methylated with reagents such as dimethylsulfate under Ba¬ 
sic conditions. Explain how exhaustive methylation followed by 
compete digestion of a known amount of glycogen would enable 
you to determine the number of branch points and reducing ends. 

12. Component parts. Raffinose is a trisaccharide and a minor 
constituent in sugar beets. 

(a) Is raffinose a reducing sugar? Explain. 

(b) What are the monosaccharides that compose raffinose? 

(c) j8-Galactosidase is an enzyme that will remove galactose 
residues from an oligosaccharide. What are the produets of 
j8-galactosidase treatment of raffinose? 



Raffinose 
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13. Anomeric differences. a-D-Mannose is a sweet-tasting sugar. 
j8-D-Mannose, on the other hand, tastes bitter. A pure solution 
of a-D-mannose loses its sweet taste with time as it is converted 
into the j8 anomer. Draw the j8 anomer and explain how it is 
formed from the a anomer. 



14. A taste of honey. Fructose in its jS-D-pyranose form accounts 
for the powerful sweetness of honey. The jS-D-furanose form, al- 
though sweet, is not as sweet as the pyranose form. The furanose 
form is the more stable form. Draw the two forms and explain 
why it may not always be wise to cook with honey. 

15. Making ends meet. (a) Compare the number of reducing ends 
to nonreducing ends in a molecule of glycogen. (b) As we will 


see in Chapter 21, glycogen is an important fuel storage form 
that is rapidly mobilized. At which end—the reducing or nonre¬ 
ducing—would you expect most metabolism to take place? 

16. Carbohydrates and proteomics. Suppose that a protein con- 
tains six potential N-linked glycosylation sites. How many pos- 
sible proteins can be generated, depending on which of these 
sites is actually glycosylated? Do not include the effects of di- 
versity within the carbohydrate added. 

Chapter Integration Problem 

17. Stereospecificity Sucrose, a major product of photosynthesis 
in green leaves, is synthesized by a battery of enzymes. The sub- 
strates for sucrose synthesis, D-glucose and D-fructose, are a 
mixture of a and j8 anomers as well as acyclic compounds in so¬ 
lution. Nonetheless, sucrose consists of a-D-glucose linked by 
its carbon-1 atom to the carbon-2 atom of j8-D-fructose. How 
can the specificity of sucrose be explained in light of the poten¬ 
tial substrates? 
















Lipids and Cell Membranes 
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The surface of a soap bubble is a bilayer formed by detergent molecules. The polar 
heads (red) pack together leaving the hydrophobic groups (green) in contact with air on 
the inside and outside of the bubble. Other bilayer structures define the boundary of a cell. 
[(Left) Photonica.] 


The boundaries of cells are formed by biological membranes, 
define the inside and the outside of a cell (Figure 12.1). These 
molecules generated inside the cell from leaking out and un- 
wanted molecules from diffusing in; yet they also contain 
transport systems that allow specific molecules to be taken 
up and unwanted compounds to be removed from the cell. 
Such transport systems confer on membranes the important 
property of selective permeability. 

Membranes are dynamic structures in which proteins 
float in a sea of lipids. The lipid components of the mem- 
brane form the permeability barrier, and protein compo¬ 
nents act as a transport system of pumps and channels that 
endow the membrane with selective permeability. 

In addition to an externai cell membrane (called the 
plasma membrane), eukaryotic cells also contain internai 
membranes that form the boundaries of organelles such as 
mitochondria, chloroplasts, peroxisomes, and lysosomes. 
Functional specialization in the course of evolution has 
been closely linked to the formation of such compartments. 
Specific Systems have evolved to allow targeting of selected 
proteins into or through particular internai membranes and, 
hence, into specific organelles. Externai and internai mem¬ 
branes have essential features in common, and these essen- 
tial features are the subject of this chapter. 
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FIGURE 12.1 Red-blood-cell plasma 
membrane. An electron micrograph of a 
preparation of plasma membranes from 
red blood cells showing the membranes 
as seen "on edge," in cross section. 
[Courtesy of Dr. Vincent Marchesi.] 


Biological membranes serve several additional important functions in- 
dispensable for life, such as energy storage and information transduction, 
that are dictated by the proteins associated with them. In this chapter, we 
will examine the general properties of membrane proteins—how they can 
exist in the hydrophobic environment of the membrane while connecting 
two hydrophilic environments—and delay a discussion of the functions of 
these proteins to the next and later chapters. 


12.1 MANY COMIVION FEATURES UNDERLIE THE 
DIVERSITY OF BIOLOGICAL MEMBRANES 

Membranes are as diverse in structure as they are in function. However, 
they do have in common a number of important attributes: 

1 . Membranes are sheetlike structures, only two molecules thick, that form 
closed boundaries between different compartments. The thickness of most 
membranes is between 60 A (6 nm) and 100 À (10 nm). 

2. Membranes consist mainly of lipids and proteins. Their mass ratio ranges 
from 1:4 to 4:1. Membranes also contain carbohydrates that are linked to 
lipids and proteins. 

3. Membrane lipids are relatively small molecules that have both hydrophilic 
and hydrophobic moieties. These lipids spontaneously form closed bimolecu- 
lar sheets in aqueous media. These lipid bilayers are barriers to the flow of 
polar molecules. 

4. Specific proteins mediate distinctive functions of membranes. Proteins serve 
as pumps, channels, receptors, energy transducers, and enzymes. Mem¬ 
brane proteins are embedded in lipid bilayers, which create suitable envi¬ 
ronments for their action. 

5. Membranes are noncovalent assemblies. The constituent protein and lipid 
molecules are held together by many noncovalent interactions, which are 
cooperative. 

6 . Membranes are asymmetric. The two faces of biological membranes al- 
ways differ from each other. 

7. Membranes dive fluid structures. Lipid molecules diffuse rapidly in the 
plane of the membrane, as do proteins, unless they are anchored by specific 
interactions. In contrast, lipid molecules and proteins do not readily rotate 
across the membrane. Membranes can be regarded as two-dimensional Solu¬ 
tions of oriented proteins and lipids. 

8 . Most cell membranes are electrically polarized, such that the inside is 
negative [typically — 60 millivolts (mV)]. Membrane potential plays a key 
role in transport, energy conversion, and excitability (Chapter 13). 


12.2 FATTY ACIDS ARE KEY CONSTITUENTS OF LIPIDS 

Among the most biologically significant properties of lipids are their hy¬ 
drophobic properties. These properties are mainly due to a particular com- 
ponent of lipids: fatty acids, or simply fats. Fatty acids also play important 
roles in signal-transduction pathways (Sections 15.x. x and 22.x.x). 

12.2.1 The Naming of Fatty Acids 

Fatty acids are hydrocarbon chains of various lengths and degrees of unsat- 
uration that terminate with carboxylic acid groups. The systematic name 


Fatty Acids 



Palmitate 

(ionized form of palmitic acid) 



Oleate 

(ionized fornn of oleie acid) 



FIGURE 12.2 Struetures of two fatty 
acids. Palmitate is a 16-carbon, saturated 
fatty acid, and oleate is an 18-carbon fatty 
acid with a single cis double bond. 


for a fatty acid is derived from the name of its parent hydrocarbon by the 
substitution of oic for the final e. For example, the Cig saturated fatty acid 
is called octadecanoic acid because the parent hydrocarbon is octadecane. A 
Cl 8 fatty acid with one double bond is called octadecenoic acid; with two 
double bonds, octadecadienoic acid; and with three double bonds, octa- 
decditrienoic acid. The notation 18:0 denotes a Ci 8 fatty acid with no double 
bonds, whereas 18:2 signifies that there are two double bonds. The strue¬ 
tures of the ionized forms of two common fatty acids—palmitic acid (Cie, 
saturated) and oleie acid (Ci 8 , monounsaturated)—are shown in Figure 12.2. 

Fatty acid carbon atoms are numbered starting at the carboxyl terminus, 
as shown in the margin. Carbon atoms 2 and 3 are often referred to as a and 
P, respectively. The methyl carbon atom at the distai end of the chain is called 
the (jú-carbon atom. The position of a double bond is represented by the Sym¬ 
bol A followed by a superseript number. For example, ds-means that there 
is a cis double bond between carbon atoms 9 and 10; trans-A^ means that 
there is a trans double bond between carbon atoms 2 and 3. Alternatively, 
the position of a double bond can be denoted by counting from the distai 
end, with the (o-carbon atom (the methyl carbon) as number 1 . An ( 0-3 fatty 
acid, for example, has the strueture shown in the margin. Fatty acids are ion¬ 
ized at physiological pH, and so it is appropriate to refer to them aceording 
to their carboxylate form: for example, palmitate or hexadecanoate. 

12.2.2 Fatty Acids Vary in Chain Length and Degree 
of Unsaturation 

Fatty acids in biological systems usually contain an even number of carbon 
atoms, typically between 14 and 24 (Table 12.1). The 16- and 18-carbon 
fatty acids are most common. The hydrocarbon chain is almost invariably 
unbranched in animal fatty acids. The alkyl chain may be saturated or it 
may contain one or more double bonds. The configuration of the double 
bonds in most unsaturated fatty acids is cis. The double bonds in polyun- 
saturated fatty acids are separated by at least one methylene group. 

The properties of fatty acids and of lipids derived from them are 
markedly dependent on chain length and degree of saturation. Unsaturated 
fatty acids have lower melting points than saturated fatty acids of the same 
length. For example, the melting point of stearic acid is 69.6°C, whereas 
that of oleie acid (which contains one cis double bond) is 13.4°C. The melt¬ 
ing points of polyunsaturated fatty acids of the Ci 8 series are even lower. 
Chain length also affects the melting point, as illustrated by the fact that 
the melting temperature of palmitic acid (C^ 5 ) is 6.5 degrees lower than that 
of stearic acid (C 18 ). Thus, short chain length and unsaturation enhance the 
fluidity of fatty acids and of their derivatives. 
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TABLE 12.1 Some naturally occurring fatty acids in animais 


Number of 
carbons 

Number of 
double bonds 

Common name 

Systematic name 

Formula 

12 

0 

Laurate 

n-Dodecanoate 

CH3(CH2)ioCOO" 

14 

0 

Myristate 

n - Tetradecanoate 

CH3(CH2)i2COO" 

16 

0 

Palmitate 

n - Hexadecanoate 

CH3(CH2)i4COO“ 

18 

0 

Stearate 

n-Octadecanoate 

CH3(CH2)i6COO- 

20 

0 

Arachidate 

n-Eicosanoate 

CH3(CH2)i8COO" 

22 

0 

Behenate 

n-Docosanoate 

CH3(CH2)2oCOO" 

24 

0 

Lignocerate 

n-Tetracosanoate 

CH3(CH2)22C00“ 

16 

1 

Palmitoleate 

cís - - Hexadecenoate 

CH3(CH2)5CH=CH(CH2)7C00“ 

18 

1 

Oleate 

cís - - Octadecenoate 

CH3(CH2)7CH=CH(CH2)7C00" 

18 

2 

Linoleate 

cis, ds-A^,A^^- 
Octadecadienoate 

CH3(CH2)4(CH=CHCH2)2(CH2)6C00" 

18 

3 

Linolenate 

all-as-A",A'^A''- 

Octadecatrienoate 

CH3CH2(CH=CHCH2)3(CH2)5C00" 

20 

4 

Arachidonate 

all-a•s-A^A^A'^A'L 

Eicosatetraenoate 

CH3(CH2)4(CH=CHCH2)4(CH2)2C00“ 


12.3 THERE ARE THREE COMIVION TYPES 
OF MEIVIBRANE LIPIDS 

Lipids differ markedly from the other groups of biomolecules considered 
thus far. By definition, lipids are water-insoluble biomolecules that are highly 
soluble in organic solvents such as chloroform. Lipids have a variety of bio- 
logical roles: they serve as fuel molecules, highly concentrated energy Stores, 
signal molecules, and components of membranes. The first three roles of 
lipids will be discussed in later chapters. Here, our focus is on lipids as mem- 
brane constituents. The three major kinds of membrane lipids are phospho- 
lipids, glycolipids, and cholesterol. We begin with lipids found in eukaryotes 
and bactéria. The lipids in archaea are distinct, although they have many 
features related to their membrane-forming function in common with lipids 
of other organisms. 



FIGURE 12.3 Schematíc structure of a 
phospholípid. 


12.3.1 Phospholipids Are the Major Class of Membrane Lipids 

Phospholipids are abundant in all biological membranes. A phospholipid 
molecule is constructed from four components: fatty acids, a platform to 
which the fatty acids are attached, a phosphate, and an alcohol attached to 
the phosphate (Figure 12.3). The fatty acid components provide a hy- 
drophobic barrier, whereas the remainder of the molecule has hydrophilic 
properties to enable interaction with the environment. 

The platform on which phospholipids are built may be glycerol, a 3- 
carbon alcohol, or sphingosine, a more complex alcohol. Phospholipids de- 
rived from glycerol are called phosphoglycerides. A phosphoglyceride consists 
of a glycerol backbone to which two fatty acid chains (whose characteristics 
were described in Section 12.2.2) and a phosphorylated alcohol are attached. 

In phosphoglycerides, the hydroxyl groups at C-1 and C-2 of glycerol 
are esterified to the carboxyl groups of the two fatty acid chains. The C-3 
hydroxyl group of the glycerol backbone is esterified to phosphoric acid. 
When no further additions are made, the resulting compound is phosphati- 
date (diacylglycerol 3-phosphate), the simplest phosphoglyceride. Only small 


















Acyl grou ps 
with fatty acid 
hydrocarbon chains 



Phosphatidate 
(Diacylglycerol 3-phosphate) 


FIGURE 12.4 Structure of 
phosphatidate (diacylglycerol 
3-phosphate). The absolute 
configuration of the center carbon 
(C-2) is shown. 


amounts of phosphatidate are present in membranes. However, the mole- 
cule is a key intermediate in the biosynthesis of the other phosphoglycerides 
(Section 26.x.x). The absolute configuration of the glycerol 3-phosphate 
moiety of membrane lipids is shown in Figure 12.4. 

The major phosphoglycerides are derived from phosphatidate by the for- 
mation of an ester bond between the phosphate group of phosphatidate and 
the hydroxyl group of one of several alcohols. The common alcohol moi- 
eties of phosphoglycerides are the amino acid serine, ethanolamine, choline, 
glycerol, and the inositol. 



Serine Ethanolamine Choline Glycerol 


The structural formulas of phosphatidyl choline and the other principal 
phosphoglycerides—namely, phosphatidyl ethanolamine, phosphatidyl 
serine, phosphatidyl inositol, and diphosphatidyl glycerol—are given in 
Figure 12.5. 

Sphingomyelin is a phospholipid found in membranes that is not derived 
from glycerol. Instead, the backbone in sphingomyelin is sphingosine, an 


Ri\ /O. 

XH2 

o_ 


.NH 3 + 


O 


o 


Phosphatidyl serine 


H 


COO" 


Rw O 


XH 2 

o o — è-H o CHj 

= |; - Hz 4 ./ 

II 

o \q H 2 CH 3 

Phosphatidyl choline 


Ri\ /O 

C XH 2 

_ o /O—Ç-H p 

- Hz 

fí H r jp /NH3^ 

1 


Phosphatidyl ethanolamine 




C" AHz 
O O^C-H O 

C ^ li - ^ H 


O 


"o 


o 



( 


HO 




Phosphatidyl inositol 






"CHz 


O O—C-H p 

i li - 

II 


o 


o 


/O .Rj 

HzC^ 

1 II 

|i- ""'i. 

r 11 


4^ 


H\ 


OH 


O 


Diphosphatidyl glycerol (cardiolipin) 


-1 323 I- 

Types of Membrane Lipids 



Inositol 


OH 


FIGURE 12.5 Some common 
phosphoglycerides found in 
membranes. 




Sphingosine 
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FIGURE 12.6 Structures of sphingosine and sphingomyelin. The sphingosine moiety 
of sphingomyelin is highlighted in blue. 



FIGURE 12.7 An archaeon and its 
environment. Archaea can thrive in 
habitats as harsh as a volcanic vent. Here, 
the archaea form an orange mat 
surrounded by yellow sulfurous deposits. 
[Krafft-Explore r/Photo Researchers.] 


amino alcohol that contains a long, unsaturated hydrocarbon chain (Figure 
12 .6). In sphingomyelin, the amino group of the sphingosine backbone is 
linked to a fatty acid by an amide bond. In addition, the primary hydroxyl 
group of sphingosine is esterified to phosphoryl choline. 

12.3.2 Archaeal Membranes Are Built from Ether Lipids 
with Branched Chains 

The membranes of archaea differ in composition from those of eukaryotes 
or bactéria in three important ways. Two of these differences clearly relate 
to the hostile living conditions of many archaea (Figure 12.7). First, the non- 
polar chains are joined to a glycerol backbone by ether rather than ester link- 
ages. The ether linkage is more resistant to hydrolysis. Second, the alkyl 
chains are branched rather than linear. They are built up from repeats of a 
fully saturated five-carbon fragment. These branched, saturated hydrocar- 
bons are more resistant to oxidation. The ability of archaeal lipids to resist 
hydrolysis and oxidation may help these organisms to withstand the extreme 
conditions, such as high temperature, low pH, or high salt concentration, 
under which some of these archaea grow. Finally, the stereochemistry of the 
central gly cerol is inverted compared with that shown in Figure 12.4. 



Membrane lipid from the archaeon Methanococcus jannaschii 


12.3.3 Membrane Lipids Can Aiso Include Carbohydrate Moieties 

Glycolipids, as their name implies, are sugar-containing lipids. Like sphin¬ 
gomyelin, the glycolipids in animal cells are derived from sphingosine. The 
amino group of the sphingosine backbone is acylated by a fatty acid, as in 
sphingomyelin. Glycolipids differ from sphingomyelin in the identity of the 
unit that is linked to the primary hydroxyl group of the sphingosine back¬ 
bone. In glycolipids, one or more sugars (rather than phosphoryl choline) 






are attached to this group. The simplest glycolipid, called a cerebroside, con- 
tains a single sugar residue, either glucose or galactose. 


O 



Sugar unit 
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More complex glycolipids, such as gangliosides, contain a branched chain of 
as many as seven sugar residues. Glycolipids are oriented in a completely 
asymmetric fashion with the sugar residues always on the extracellular side 
of the membrane. 

12.3.4 Cholesterol Is a Lipid Based on a Steroid Nucleus 

Cholesterol is a lipid with a structure quite different from that of phospho- 
lipids. It is a steroid, built from four linked hydrocarbon rings. 



A hydrocarbon tail is linked to the steroid at one end, and a hydroxyl group 
is attached at the other end. In membranes, the molecule is oriented 
parallel to the fatty acid chains of the phospholipids, and the hydroxyl 
group interacts with the nearby phospholipid head groups. Cholesterol 
is absent from prokaryotes but is found to varying degrees in virtually 
all animal membranes. It constitutes almost 25% of the membrane 
lipids in certain nerve cells but is essentially absent from some intracellu- 
lar membranes. 

12.3.5 A Membrane Lipid Is an Amphipathic Molecule 
Containing a Hydrophilic and a Hydrophobic Moiety 

The repertoire of membrane lipids is extensive, perhaps even bewildering, 
at first sight. However, they possess a criticai common structural theme: 
membrane lipids are amphipathic molecules (amphiphilic molecules). A mem¬ 
brane lipid contains both a hydrophilic and a hydrophobic moiety. 

Let us look at a model of a phosphoglyceride, such as phosphatidyl 
choline. Its overall shape is roughly rectangular (Figure 12.8A). The two 
hydrophobic fatty acid chains are approximately parallel to each other, 
whereas the hydrophilic phosphoryl choline moiety points in the opposite 
direction. Sphingomyelin has a similar conformation, as does the archaeal 
lipid depicted. Therefore, the following shorthand has been adopted to rep- 
resent these membrane lipids: the hydrophilic unit, also called the po/ar head 
group, is represented by a circle, whereas the hydrocarbon tails are depicted 
by straight or wavy lines (Figure 12.8B). 
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FIGURE 12.9 Diagram of a sectíon of 
a micelle. lonized fatty acids readily form 
such structures, but most phospholipids 
do not. 



FIGURE 12.10 Diagram of a section of 
a bilayer membrane. 



Phosphoglyceride 



Sphingomyelin 



Archaeal lipid 


FIGURE 12.8 Representatíons of 
membrane lipids. (A) Space-fíliing models 
of a phosphoglyceride, sphingomyelin, and 
an archaeal lipid show their shapes and 
distribution of hydrophilic and hydrophobic 
moieties. (B) A shorthand depiction of a 
membrane lipid. 



Shorthand depiction 


12.4 PHOSPHOLIPIDS AND GLYCOLIPIDS READILY 
FORM BIMOLECULAR SHEETS IN AQUEOUS MEDIA 

What properties enable phospholipids to form membranes? Membrane for- 
mation is a consequence of the amphipathic nature of the molecules. Their po¬ 
lar head groups favor contact with water, whereas their hydrocarbon tails 
interact with one another, in preference to water. How can molecules with 
these preferences arrange themselves in aqueous Solutions? One way is to 
form a micelle, a globular structure in which polar head groups are sur- 
rounded by water and hydrocarbon tails are sequestered inside, interacting 
with one another (Figure 12.9). 

Alternatively, the strongly opposed preferences of the hydrophilic and 
hydrophobic moieties of membrane lipids can be satisfied by forming a lipid 
bilayer, composed of two lipid sheets (Figure 12.10). A lipid bilayer is also 
called a bimolecular sheet. The hydrophobic tails of each individual sheet in¬ 
teract with one another, forming a hydrophobic interior that acts as a per- 
meability barrier. The hydrophilic head groups interact with the aqueous 
médium on each side of the bilayer. The two opposing sheets are called 
leaflets. 

The favored structure for most phospholipids and glycolipids in aqueous me¬ 
dia is a bimolecular sheet rather than a micelle. The reason is that the two 
fatty acyl chains of a phospholipid or a glycolipid are too bulky to fit into 
the interior of a micelle. In contrast, salts of fatty acids (such as sodium 
palmitate, a constituent of soap), which contain only one chain, readily form 
micelles. The formation of bilayers instead of micelles by phospholipids is of 
criticai biological importance. A micelle is a limited structure, usually less 
than 20 nm (200 À) in diameter. In contrast, a bimolecular sheet can have 
macroscopic dimensions, such as a millimeter (10^ nm, or 10^ A). Phos¬ 
pholipids and related molecules are important membrane constituents be- 
cause they readily form extensive bimolecular sheets (Figure 12.11). 

The formation of lipid bilayers is a self-assembly process. In other words, 
the structure of a bimolecular sheet is inherent in the structure of the con¬ 
stituent lipid molecules. The growth of lipid bilayers from phospholipids is 
a rapid and spontaneous process in water. Hydrophobic interactions are the 






























major driving force for the formation of lipid bilayers. Recall that hydropho- 
bic interactions also play a dominant role in the folding of proteins (Sections 
1.3.4 and 3.4) and in the stacking of bases in nucleic acids (Section 5.2.1). 
Water molecules are released from the hydrocarbon tails of membrane lipids 
as these tails become sequestered in the nonpolar interior of the bilayer. Fur- 
thermore, van der Waals attractive forces between the hydrocarbon tails favor 
dose packing of the tails. Finally, there are electrostatic and hydrogen-bond- 
ing attractions between the polar head groups and water molecules. Thus, lipid 
bilayers are stabilized by the full array of forces that mediate molecular in¬ 
teractions in biological systems. 

Because lipid bilayers are held together by many reinforcing, noncova- 
lent interactions (predominantly hydrophobic), they are cooperative struc- 
tures. These hydrophobic interactions have three significant biological 
consequences: (1) lipid bilayers have an inherent tendency to be extensive; 

(2) lipid bilayers will tend to dose on themselves so that there are no edges 
with exposed hydrocarbon chains, and so they form compartments; and 

(3) lipid bilayers are self-sealing because a hole in a bilayer is energetically 
unfavorable. 


FIGURE 12.11 Space-filling model 
of a section of phospholipid bilayer 
membrane. 


12.4.1 Lipid Vesicles Can Be Formed from Phospholipids 

The propensity of phospholipids to form membranes has been used to cre- 
ate an important experimental and clinicai tool. Lipid vesicles, or liposomes, 
aqueous compartments enclosed by a lipid bilayer (Figure 12.12), can be 
used to study membrane permeability or to deliver Chemicals to cells. Li- 
posomes are formed by suspending a suitable lipid, such as phosphatidyl 
choline, in an aqueous médium, and then sonicating (i.e., agitating by high- 
frequency sound waves) to give a dispersion of closed vesicles that are quite 
uniform in size. Vesicles formed by these methods are nearly spherical in 
shape and have a diameter of about 50 nm (500 A). Larger vesicles (of the 
order of 1 fxm, or lO"^ A, in diameter) can be prepared by slowly evaporat- 
ing the organic solvent from a suspension of phospholipid in a mixed sol- 
vent System. 

lons or molecules can be trapped in the aqueous compartments of 
lipid vesicles by forming the vesicles in the presence of these substances 
(Figure 12.13). For example, 50-nm-diameter vesicles formed in a 0.1 M 



FIGURE 12.12 Líposome. A liposome, 
or lipid vesicle, is a small aqueous 
compartnnent surrounded by a lipid bilayer. 
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FIGURE 12.13 Preparation of glycine- 
containíng liposomes. Liposomes 
containing glycine are formed by 
sonication of phospholipids in the 
presence of glycine. Free glycine is 
removed by gel filtration. 



FIGURE 12.14 Experimental 
arrangement for the study of planar 
bilayer membrane. A bilayer membrane 
is formed across a 1-mm hole in a septum 
that separates two aqueous compartments. 
This arrangement permits measurements 
of the permeability and electrical 
conductance of lipid bilayers. 


glycine solution will trap about 2000 molecules of glycine in each inner 
aqueous compartment. These glycine-containing vesicles can be separated 
from the surrounding solution of glycine by dialysis or by gel-filtration 
chromatography (Section 4.1.3). The permeability of the bilayer membrane 
to glycine can then be determined by measuring the rate of efflux of glycine 
from the inner compartment of the vesicle to the ambient solution. Specific 
membrane proteins can be solubilized in the presence of detergents and then 
added to the phospholipids from which liposomes will be formed. Protein- 
liposome complexes provide valuable experimental tools for examining a 
range of membrane -protein functions. 

T Experiments are underway to develop clinicai uses for liposomes. For 
example, liposomes containing drugs or DNA for gene-therapy ex- 
periments can be injected into patients. These liposomes fuse with the 
plasma membrane of many kinds of cells, introducing into the cells the mol¬ 
ecules that they contain. Drug delivery with liposomes also alters the 
distribution of a drug within the body and often lessens its toxicity. For 
instance, less of the drug is distributed to normal tissues because long- 
circulating liposomes have been shown to concentrate in regions of increased 
blood circulation, such as solid tumors and sites of inflammation. More- 
over, the selective fusion of lipid vesicles with particular kinds of cells is a 
promising means of controlling the delivery of drugs to target cells. 

Another well-defined synthetic membrane is 2 i planar bilayer membrane. 
This structure can be formed across a 1 -mm hole in a partition between two 
aqueous compartments by dipping a fine paintbrush into a membrane- 
forming solution, such as phosphatidyl choline in decane. The tip of the 
brush is then stroked across a hole (1 mm in diameter) in a partition be¬ 
tween two aqueous media. The lipid film across the hole thins spontaneously 
into a lipid bilayer. The electrical conduction properties of this macroscopic 
bilayer membrane are readily studied by inserting electrodes into each aque¬ 
ous compartment (Figure 12.14). For example, its permeability to ions is 
determined by measuring the current across the membrane as a function of 
the applied voltage. 

12.4.2 Lipid Bilayers Are Highly Impermeable to Ions 
and Most Polar Molecules 

The results of permeability studies of lipid vesicles and electrical-conduc¬ 
tance measurements of planar bilayers have shown that lipid bilayer mem- 
branes have a very low permeability for ions and most polar molecules. Water 
is a conspicuous exception to this generalization; it readily traverses such 
membranes because of its small size, high concentration, and lack of a com¬ 
plete charge. The range of measured permeability coefficients is very wide 
(Figure 12.15). For example, Na^ and traverse these membranes 10^ 
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FIGURE 12.15 Permeability coefficients (P) of ions and molecules in a lipid 
bilayer. The ability of molecules to cross a lipid bilayer spans a wide range of values. 
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membrane 10^ times as slowly as does indole, a structurally related mole- Membrane Proteins 

cule that lacks ionic groups. In fact, the permeability coefficients of small mol- 

ecules are correlated with their solubility in a nonpolar solvent relative to their 

solubility in water. This relation suggests that a small molecule might tra- 

verse a lipid bilayer membrane in the following way: first, it sheds its sol- 

vation shell of water; then, it becomes dissolved in the hydrocarbon core of 

the membrane; finally, it diffuses through this core to the other side of the 

membrane, where it becomes resolvated by water. An ion such as Na^ tra- 

verses membranes very slowly because the replacement of its coordination 

shell of polar water molecules by nonpolar interactions with the membrane 

interior is highly unfavorable energetically. 


12.5 PROTEINS CARRY OUT MOST MEMBRANE 
PROCESSES 


We now turn to membrane proteins, which are responsible for most of the 
dynamic processes carried out by membranes. Membrane lipids form a per¬ 
meability barrier and thereby establish compartments, whereas specific pro¬ 
teins mediate nearly all other membrane functions. In particular, proteins 
transport Chemicals and Information across a membrane. Membrane lipids 
create the appropriate environment for the action of such proteins. 

Membranes differ in their protein content. Myelin, a membrane that 
serves as an insulator around certain nerve fibers, has a low content of pro¬ 
tein (18%). Relatively pure lipids are well suited for insulation. In contrast, 
the plasma membranes or exterior membranes of most other cells are much 
more active. They contain many pumps, channels, receptors, and enzymes. 
The protein content of these plasma membranes is typically 50%. Energy- 
transduction membranes, such as the internai membranes of mitochondria 
and chloroplasts, have the highest content of protein, typically 75%. 

The protein components of a membrane can be readily visualized by 
SDS-polyacrylamide gel electrophoresis. As discussed earlier (Section 4.1.4), 
the electrophoretic mobility of many proteins in SDS-containing gels de- 
pends on the mass rather than on the net charge of the protein. The gel- 
electrophoresis patterns of three membranes—the plasma membrane of ery- 
throcytes, the photoreceptor membrane of retinal rod cells, and the 
sarcoplasmic reticulum membrane of muscle—are shown in Figure 12.16. 
It is evident that each of these three membranes contains many proteins but 
has a distinct protein composition. In general, membranes performing differ- 
ent functions contain different repertoires of proteins. 
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FIGURE 12.16 SDS-acrylamíde gel 
patterns of membrane proteins. 

(A) The plasma membrane of erythrocytes. 

(B) The photoreceptor membranes of 
retinal rod cells. (C) The sarcoplasmic 
reticulum membrane of muscle cells. 
[Courtesy of Dr. Theodore Steck (Part A) and 
Dr. David MacLennan (Part C).] 


12.5.1 Proteins Associate with the Lipid Bilayer in a 
Variety of Ways 

The ease with which a protein can be dissociated from a membrane indicates 
how intimately it is associated with the membrane. Some membrane proteins 
can be solubilized by relatively mild means, such as extraction by a solution 
of high ionic strength (e.g., 1 M NaCl). Other membrane proteins are bound 
much more tenaciously; they can be solubilized only by using a detergent or 
an organic solvent. Membrane proteins can be classified as being either per- 
ipheral or integral on the basis of this difference in dissociability (Figure 
12.17). Integral membrane proteins interact extensively with the hydrocarbon 
chains of membrane lipids, and so only agents that compete for these non¬ 
polar interactions can release them. In fact, most integral membrane proteins 
span the lipid bilayer. In contrast, peripheral membrane proteins are bound to 
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FIGURE 12.17 Integral and perípheral 
membrane proteíns. Integral membrane 
proteins (a, b, and c) interact extensively 
with the hydrocarbon region of the bilayer. 
Nearly all known integral membrane proteins 
traverse the lipid bilayer. Peripheral 
membrane proteins (d and e) bind to the 
surfaces of integral proteins. Some peripheral 
membrane proteins interact with the polar 
head groups of the lipids (not shown). 


membranes primarily by electrostatic and hydrogen-bond interactions with 
the head groups of lipids. These polar interactions can be disrupted by adding 
salts or by changing the pH. Many peripheral membrane proteins are bound 
to the surfaces of integral proteins, on either the cytosolic or the extracellu- 
lar side of the membrane. Others are anchored to the lipid bilayer by a co- 
valently attached hydrophobic chain, such as a fatty acid. 



FIGURE 12.18 Structure of 
bacteriorhodopsin. Bacteriorhodopsin 
consists largely of membrane-spanning 
a helices. (A) View through the membrane 
bilayer. The interior of the membrane is 
green and the head groups are red. 

(B) A view from the cytoplasmic side of 
the membrane. 


12.5.2 Proteins Interact with Membranes in a Variety of Ways 

Although membrane proteins are more difficult to purify and crystallize 
than are water-soluble proteins, researchers using x-ray crystallographic or 
electron microscopic methods have determined the three- dimensional 
structures of more than 20 such proteins at sufficiently high resolution to 
discern the molecular details. As noted in Chapter 3, the structures of mem¬ 
brane proteins differ from those of soluble proteins with regard to the dis- 
tribution of hydrophobic and hydrophilic groups. We will consider the 
structures of three membrane proteins in some detail. 

Proteins Can Span the Membrane with Alpha Helices. The first 
membrane protein that we consider is the archaeal protein bacteriorhodopsin, 
shown in Figure 12.18. This protein acts in energy transduction by using 
light energy to transport protons from inside the cell to outside. The pro- 
ton gradient generated in this way is used to form ATP. Bacteriorhodopsin 
is built almost entirely of a helices; seven closely packed a helices, arranged 
almost perpendicularly to the plane of the membrane, span its 45-A width. 
Examination of the primary structure of bacteriorhodopsin reveals that most 
of the amino acids in these membrane-spanning a helices are nonpolar and 
only a very few are charged (Figure 12.19). This distribution of nonpolar 
amino acids is sensible because these residues are either in contact with the 
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hydrocarbon core of the membrane or with one another. Membrane-span- 
ning a helices are the most common structural motif in membrane proteins. As 
will be discussed in Section 12.5.4, such regions can often be detected from 
amino acid sequence Information alone. 


FIGURE 12.19 Amino add 
sequence of bacteríorhodopsín. 

The seven helical regions are high- 
lighted in yellow and the charged 
residues in red. 


AQ I T G IHPIEMIf I WLALGTALMGLGTLYFL VÍKl GMG V S MPMAIK KIFYAITTLVPAI 
IIAFTMYLSMLI LG Y G L TMV P F GGHQN P I YWA[R|Y ABW L FTTPLLLLHLALLV 

■ A PIPG tila L VG AMG I M I G T G L VGÃTl TMv nrS YIRIF VWWA I S T A A M L Y I L Y~V] 

L F FG F T SWAigSMWPHV lAS T FlKlV LWN V T V V L W S A Y V V VWni G SHG A G I V P[L| 

IN IIEIT L L FMV LIIIV S AIKIV G FG L I L LlW SllA I F GHAflA P EflS ABG A A A T S 


A Channel Protein Can Be Formed from Beta Strands. Porin, a 
protein from the outer membranes of bactéria such as E. coli and Rhodo- 
bacter capsulatus, represents a class of membrane proteins with a completely 
different type of structure. Structures of this type are built from |3 strands 
and contain essentially no a helices (Figure 12.20). 





FIGURE 12.20 Structure of 
bacterial porin (from Rhodopseudomonas 
blastíca). Porin is a membrane protein 
built entirely of beta strands. (A) Side view. 
(B) View from the periplasmic space. Only 
one monomer of the trimeric protein is 
shown. 


The arrangement of |3 strands is quite simple: each strand is hydrogen 
bonded to its neighbor in an antiparallel arrangement, forming a single j8 
sheet. The |3 sheet curls up to form a hollow cylinder that functions as the 
active unit. As its name suggests, porin forms pores, or channels, in the 
membranes. A pore runs through the center of each cylinder-like protein. 
The outside surface of porin is appropriately nonpolar, given that it inter- 
acts with the hydrocarbon core of the membrane. In contrast, the inside of 
the channel is quite hydrophilic and is filled with water. This arrangement 
of nonpolar and polar surfaces is accomplished by the alternation of hy- 
drophobic and hydrophilic amino acids along each |3 strand (Figure 12.21). 



FIGURE 12.21 Amino add sequence 
of a porin. Some membrane proteins 
such as porins are built from p strands 
that tend to have hydrophobic and 
hydrophilic amino acids in adjacent 
positions. The secondary structure of 
Rhodopseudomonas blastica is shown, 
with the diagonal lines indicating the 
direction of hydrogen bonding along the 
p sheet. Flydrophobic residues (F, I, L, M, 
V, W, and Y) are shown in yellow. These 
residues tend to lie on the outside of the 
structure, in contact with the hydrophobic 
core of the membrane. 
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FIGURE 12.22 Formation of 
prostaglandin H 2 . Prostaglandin H 2 
synthase-1 catalyzes the formation of 
prostaglandin H 2 from arachidonic add 
in two steps. 


Embedding Part of a Protein in a Membrane Can Link It to the 
Membrane Surface. The structure of the membrane-bound enzyme prosta¬ 
glandin H 2 synthase-1 reveals a rather different role for a helices in protein- 
membrane associations. This enzyme catalyzes the conversion of arachidonic 
acid into prostaglandin H 2 in two steps: a cyclooxygenase reaction and a per- 
oxidase reaction (Figure 12.22). Prostaglandin H 2 promotes inflammation 
and modulates gastric acid secretion. The enzyme that produces prostaglandin 
H 2 is a homodimer with a rather complicated structure consisting primarily 
of a helices. Unlike bacteriorhodopsin, this protein is not largely embedded 
in the membrane but, instead, lies along the outer surface of the membrane 
firmly bound by a set of a helices with hydrophobic surfaces that extend from 
the bottom of the protein into the membrane (Figure 12.23). This linkage is 
sufficiently strong that only the action of detergents can release the protein 
from the membrane. Thus, this enzyme is classified as an integral membrane 
protein, although it is not a membrane-spanning protein. 



Hydrophobic amino acid 
side chains 



FIGURE 12.23 Attachment of prostaglandin H 2 synthase-1 to the membrane. 

Prostaglandin H 2 synthase-1 is held in the membrane by a set of a helices coated with 
hydrophobic side chains. One monomer of the dimeric enzyme is shown. 



FIGURE 12.24 Hydrophobic channel of prostaglandin H 2 synthase. A view 
of prostaglandin H 2 synthase from the membrane, showing the hydrophobic channel that 
leads to the active site. The membrane-anchoring helices are shown in orange. 



















T The localization of prostaglandin H 2 synthase-1 in the membrane is 
crucial to its function. The substrate for this enzyme, arachidonic acid, 
is a hydrophobic molecule generated by the hydrolysis of membrane lipids. 
Arachidonic acid reaches the active site of the enzyme from the membrane 
without entering an aqueous environment by traveling through a hydropho¬ 
bic channel in the protein (Figure 12.24). Indeed, nearly all of us have ex- 
perienced the importance of this channel: drugs such as aspirin and ibupro- 
fen block the channel and prevent prostaglandin synthesis by inhibiting the 
cyclooxygenase activity of the synthase. In the case of aspirin, the drug acts 
through the transfer of an acetyl group from the aspirin to a serine residue 
(Ser 530) that lies along the path to the active site (Figure 12.25). 
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(Acetylsalicyclic acid) 


Two important features emerge from our examination of these three ex- 
amples of membrane protein structure. First, the parts of the protein that 
interact with the hydrophobic parts of the membrane are coated with non- 
polar amino acid side chains, whereas those parts that interact with the aque¬ 
ous environment are much more hydrophilic. Second, the structures posi- 
tioned within the membrane are quite regular and, in particular, all 
backbone hydrogen-bond donors and acceptors participate in hydrogen 
bonds. Breaking a hydrogen bond within a membrane is quite unfavorable, be- 
cause little or no water is present to compete for the polar groups. 



O 


FIGURE 12.25 Aspírín's effects on 
prostaglandin H 2 synthase-1. Aspirin 
acts by transferring an acetyl group to 
a serine residue in prostaglandin H 2 
synthase-1. 


12.5.3 Some Proteins Associate with Membranes 
Through Covalentiy Attached Hydrophobic Groups 

The membrane proteins considered thus far associate with the membrane 
through surfaces generated by hydrophobic amino acid side chains. How- 
ever, even otherwise soluble proteins can associate with membranes if the 
association is mediated by hydrophobic groups attached to the proteins. 
Three such groups are shown in Figure 12.26: (1) a palmitoyl group attached 
to a specific cysteine residue by a thioester bond, (2) a farnesyl group at¬ 
tached to a cysteine residue at the carboxyl terminus, and (3) a glycolipid 
structure termed a glycosyl phosphatidyl inositol (GPI) anchor attached to 
the carboxyl terminus. These modifications are attached by enzyme systems 
that recognize specific signal sequences near the site of attachment. 





FIGURE 12.26 Membrane 
anchors. Membrane anchors 
are hydrophobic groups that are 
covalentiy attached to proteins 
(in blue) and tether the proteins 
to the membrane. The red and 
blue hexagons correspond to 
mannose and GlcNAc, respectively. 
R groups represent points of 
additional modification. 
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TABLE 12.2 Polarity scale for 
identifying transmembrane 
helices 


Amino 

acid 

residue 

Transfer free 
energy kcal mol~^ 
(kJ moP^) 

Phe 

3.7 

(15.5) 

Met 

3.4 

(14.3) 

Ile 

3.1 

(13.0) 

Leu 

2.8 

(11.8) 

Val 

2.6 

(10.9) 

Cys 

2.0 

(8.4) 

Trp 

1.9 

(8.0) 

Ala 

1.6 

(6.7) 

Thr 

1.2 

(5.0) 

Gly 

1.0 

(4.2) 

Ser 

0.6 

(2.5) 

Pro 

-0.2 

(-0.8) 

Tyr 

-0.7 

(-2.9) 

His 

-3.0 ( 

-12.6) 

Gin 

-4.1 ( 

-17.2) 

Asn 

-4.8 ( 

-20.2) 

Glu 

-8.2 ( 

-34.4) 

Lys 

-8.8 ( 

-37.0) 

Asp 

-9.2 ( 

-38.6) 

Arg 

-12.3 ( 

-51.7) 


Source: After D. M. Engelman, T. A. 
Steitz, and A. Goldman. Annu. Rev. 
Biophys. Biophys. Chem. 15(1986):330. 
Note: The free energies are for the transfer 
of an amino acid residue in an a helix 
from the membrane interior (assumed to 
have a dielectric constant of 2) to water. 


12.5.4 Transmembrane Helices Can Be Accurately Predicted 
from Amino Acid Sequences 

Many membrane proteins, like bacteriorhodopsin, employ a helices to span 
the hydrophobic part of a membrane. As noted earlier, typically most of 
the residues in these a helices are nonpolar and almost none of them are 
charged. Can we use this information to identify putative membrane-spanning 
regions from sequence data alone? One approach to identifying transmem¬ 
brane helices is to ask whether a postulated helical segment is likely to be 
most stable in a hydrocarbon milieu or in water. Specifically, we want to es- 
timate the free-energy change when a helical segment is transferred from 
the interior of a membrane to water. Free-energy changes for the transfer 
of individual amino acid residues from a hydrophobic to an aqueous envi- 
ronment are given in Table 12.2. For example, the transfer of a poly-L-argi- 
nine helix, a homopolymer of a positively charged amino acid, from the in¬ 
terior of a membrane to water would be highly favorable [ — 12.3 kcal mol~^ 
( — 51.5 kj moP^) per arginine residue in the helix], whereas the transfer of 


(A) Outside Bilayer Inside 




First amino acid residue 
in window 

FIGURE 12.27 Locatíng the membrane-spanníng helix of glycophorín. (A) Amino 
acid sequence and transmembrane disposition of glycophorin A from the red-cell membrane. 
Fifteen 0-linked carbohydrate units are shown as diamond shapes, and an A/-linked unit is 
shown as a lozenge shape. The hydrophobic residues (yellow) buried in the bilayer form a 
transmembrane a helix. The carboxyl-terminal part of the molecule, located on the cytosolic 
side of the membrane, is rich in negatively charged (red) and positively charged (blue) 
residues. (B) Hydropathy plot for glycophorin. The free energy for transfering a helix of 20 
residues from the membrane to water is plotted as a function of the position of the first 
residue of the helix in the sequence of the protein. Peaks of greater than +20 kcal mol“’ in 
hydropathy plots are indicative of potential transmembrane helices. [(A) Courtesy of Dr. Vincent 
Marchesi; (B) after D. M. Engelman, T. A. Steitz, and A. Goldman. Identifying nonpolar transbilayer helices 
in amino acid sequences of membrane proteins. Annu. Rev. Biophys. Biophys. Chem. 15(1986):343. 
Copyright © 1986 by Annual Reviews, Inc. All rights reserved.] 
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FIGURE 12.28 Hydropathy piot for 
porín. No strong peaks are obsen/ed for 
this intrinsic membrane protein because it 
is constructed from membrane-spanning 
p strands rather than a helices. 


a poly-L-phenylalanine helix, a homopolymer of a hydrophobic amino acid, 
would be unfavorable [ + 3.7 kcal moP^ ( + 15.5 kj moP^) per phenylala- 
nine residue in the helix]. 

The hydrocarbon core of a membrane is typically 30 A wide, a length that 
can be traversed by an a helix consisting of 20 residues. We can take the 
amino acid sequence of a protein and estimate the free-energy change that 
takes place when a hypothetical a helix formed of residues 1 through 20 is 
transferred from the membrane interior to water. The same calculation can 
be made for residues 2 through 21,3 through 22, and so forth, until we reach 
the end of the sequence. The span of 20 residues chosen for this calculation 
is called a window. The free-energy change for each window is plotted against 
the first amino acid at the window to create a hydropathy plot. Empirically, a 
peak of + 20 kcal moP^ ( + 84 kJ moP^) or more in a hydropathy plot based 
on a window of 20 residues indicates that a polypeptide segment could be a 
membrane-spanning a helix. For example, glycophorin, a protein found in 
the membranes of red blood cells, is predicted by this criterion to have one 
membrane-spanning helix, in agreement with experimental findings (Figure 
12.27). It should be noted, however, that a peak in the hydropathy plot does 
not prove that a segment is a transmembrane helix. Fven soluble proteins 
may have highly nonpolar regions. Conversely, some membrane proteins 
contain transmembrane structural features (such as membrane-spanning |3 
strands) that escape detection by these plots (Figure 12.28). 


12.6 LIPIDS AND MANY MEMBRANE PROTEINS 
DIFFUSE RAPIDLY IN THE PLANE OF THE MEMBRANE 


Biological membranes are not rigid, static structures. On the contrary, lipids 
and many membrane proteins are constantly in lateral motion, a process 
called lateral diffusion. The rapid lateral movement of membrane proteins has 
been visualized by means of fluorescence microscopy through the use of the 
technique of fluorescence recovery after photobleaching (FRAP; Figure 12.29). 


FIGURE 12.29 Fluorescence recovery 
after photobleaching (FRAP) technique. 

(A) The cell-surface fluoresces because 
of a labeled surface component. (B) The 
fluorescent molecules of a small part of 
the surface are bleached by an intense 
light pulse. (C) The fluorescence intensity 
recovers as bleached molecules diffuse out 
of the region and unbleached molecules 
diffuse into it. (D) The rate of recovery 
depends on the diffusion coefficient. 
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s = (4Dt)^/2 

The diffusion coefficient of lipids in a variety of membranes is about 

1 jam^ s~\ Thus, a phospholipid molecule diffuses an average distance of 

2 jütm in 1 s. This rate means that a lipid molecule can travei from one end of 
a bacterium to the other in a second. The magnitude of the observed diffu¬ 
sion coefficient indicates that the viscosity of the membrane is about 100 
times that of water, rather like that of olive oil. 

In contrast, proteins vary markedly in their lateral mobility. Some pro- 
teins are nearly as mobile as lipids, whereas others are virtually immobile. For 
example, the photoreceptor protein rhodopsin (Section 32.3.1), a very mo¬ 
bile protein, has a diffusion coefficient of 0.4 jixm^ s~^. The rapid move- 
ment of rhodopsin is essential for fast signaling. At the other extreme is fi- 
bronectin, a peripheral glycoprotein that interacts with the extracellular 
matrix. For fibronectin, D is less than 10~"^ jixm^ s~\ Fibronectin has a very 
low mobility because it is anchored to actin filaments on the inside of the 
plasma membrane through integrin, a transmembrane protein that links the 
extracellular matrix to the cytoskeleton. 

12.6.1 The Fluid Mosaic Model Allows Lateral Movement but 
Not Rotation Through the Membrane 

On the basis of the dynamic properties of proteins in membranes, S. 
Jonathan Singer and Garth Nicolson proposed the concept of di fluid mosaic 
model for the overall organization of biological membranes in 1972 (Figure 
12.30). The essence of their model is that membranes are two-dimensional So¬ 
lutions of oriented lipids and globular proteins. The lipid bilayer has a dual 
role: it is both a solvent for integral membrane proteins and a permeability 


First, a cell-surface component is specifically labeled with a fluorescent 
chromophore. A small region of the cell surface (^^3 juim^) is viewed through 
a fluorescence microscope. The fluorescent molecules in this region are then 
destroyed (bleached) by a very intense light pulse from a laser. The fluo¬ 
rescence of this region is subsequently monitored as a function of time by 
using a light levei sufficiently low to prevent further bleaching. If the la¬ 
beled component is mobile, bleached molecules leave and unbleached mol¬ 
ecules enter the illuminated region, which results in an increase in the flu¬ 
orescence intensity. The rate of recovery of fluorescence depends on the 
lateral mobility of the fluorescence-labeled component, which can be ex- 
pressed in terms of a diffusion coefficient, D. The average distance s tra- 
versed in time t depends on D according to the expression 



FIGURE 12.30 Fluid mosaic model. [After S. J. Singer and G. L Nicolson. Science 175(1972):723.] 













barrier. Membrane proteins are free to diffuse laterally in the lipid matrix 
unless restricted by special interactions. 

Although the lateral diffusion of membrane components can be rapid, 
the spontaneous rotation of lipids from one face of a membrane to the other 
is a very slow process. The transition of a molecule from one membrane sur- 
face to the other is called transverse diffusion oiflip-flop (Figure 12.31) The 
flip-flop of phospholipid molecules in phosphatidyl choline vesicles has 
been directly measured by electron spin resonance techniques, which show 
that a phospholipid molecule flip-flops once in several hours. Thus, a phos¬ 
pholipid molecule takes about 10^ times as long to flip-flop across a mem¬ 
brane as it takes to diffuse a distance of 50 A in the lateral direction. The 
free-energy barriers to flip-flopping are even larger for protein molecules 
than for lipids because proteins have more extensive polar regions. In fact, 
the flip-flop of a protein molecule has not been observed. Hence, membrane 
asymmetry can be preserved for long periods. 
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12.6.2 Membrane Fluidity Is Controlled by Fatty Acid 
Composition and Cholesterol Content 

Many membrane processes, such as transport or signal transduction, de- 
pend on the fluidity of the membrane lipids, which in turn depends on the 
properties of fatty acid chains, which can exist in an ordered, rigid State or 
in a relatively disordered, fluid State. The transition from the rigid to the 
fluid State occurs rather abruptly as the temperature is raised above T^, the 
melting temperature (Figure 12.32). This transition temperature depends on 
the length of the fatty acyl chains and on their degree of unsaturation (Table 
12.3). The presence of saturated fatty acyl residues favors the rigid State be¬ 
cause their straight hydrocarbon chains interact very favorably with each 
other. On the other hand, a cis double bond produces a bend in the hydrocar¬ 
bon chain. This bend interferes with a highly ordered packing of fatty acyl 
chains, and so T^ is lowered (Figure 12.33). The length of the fatty acyl chain 
also affects the transition temperature. Long hydrocarbon chains interact 
more strongly than do short ones. Specifically, each additional —CH 2 - group 
makes a favorable contribution of about —0.5 kcal moP^ ( — 2.1 kj moP^) 
to the free energy of interaction of two adjacent hydrocarbon chains. 

Bactéria regulate the fluidity of their membranes by varying the number of 
double bonds and the length of their fatty acyl chains. For example, the ratio 
of saturated to unsaturated fatty acyl chains in the E. coli membrane de- 
creases from 1.6 to 1.0 as the growth temperature is lowered from 42°C to 
27°C. This decrease in the proportion of saturated residues prevents the 
membrane from becoming too rigid at the lower temperature. 


FIGURE 12.31 Lipid movement in 
membranes. Lateral diffusion of lipids is 
much more rapid than transverse diffusion 
(flip-flop). 



Temperature —> 

FIGURE 12.32 The phase-transition, 
or melting, temperature (7^) for a 
phospholipid membrane. As the 

temperature is raised, the phospholipid 
membrane changes from a packed, 
ordered State to a more random one. 


I- 

TABLE 12.3 The melting temperature of phosphatidyl choline containing 
different pairs of identical fatty acid chains 


Number 

of 

carbons 

Number 
of double 
bonds 


Fatty acid 

(°C) 

Common 

name Systematic name 

22 

0 

Behenate 

n-Docosanote 

75 

18 

0 

Stearate 

n-Octadecanoate 

58 

16 

0 

Palmitate 

n - Hexadecanoate 

41 

14 

0 

Myristate 

n - Tetradecanoate 

24 

18 

1 

Oleate 

cis - - Octadecenoate 

-22 














































FIGURE 12.33 Packíng of fatty add 
chains ín a membrane. The highly 
ordered packing of fatty acid chains is 
disrupted by the presence of cis double 
bonds. The space-filling models show the 
packing of (A) three molecules of stearate 
(Ci8, saturated) and (B) a molecule of 
oleate (Cis, unsaturated) between two 
molecules of stearate. 



(A) 



In animais, cholesterol is the key regulator of membrane fluidity. Choles- 
terol contains a bulky steroid nucleus with a hydroxyl group at one end and 
a flexible hydrocarbon tail at the other end. Cholesterol inserts into bi- 
layers with its long axis perpendicular to the plane of the membrane. The 
hydroxyl group of cholesterol forms a hydrogen bond with a carbonyl oxy- 
gen atom of a phospholipid head group, whereas the hydrocarbon tail of 
cholesterol is located in the nonpolar core of the bilayer. The different shape 
of cholesterol compared with phospholipids disrupts the regular interac- 
tions between fatty acyl chains. In addition, cholesterol appears to form spe- 
cific complexes with some phospholipids. Such complexes may concentrate 
in specific regions within membranes. One result of these interactions is the 
moderation of membrane fluidity, making membranes less fluid but at the 
same time less subject to phase transitions. 



FIGURE 12.34 Asymmetry of the 
Na^-K^ transport system ín plasma 
membranes. The Na^-K^ transport 
System pumps Na^ out of the cell and 
into the cell. 


12.6.3 All Biological Membranes Are Asymmetric 

Membranes are structurally and functionally asymmetric. The outer and in- 
ner surfaces of all known biological membranes have different components and 
different enzymatic activities. A clear-cut example is the pump that regulates 
the concentration of Na^ and ions in cells (Figure 12.34). This trans¬ 
port protein is located in the plasma membrane of nearly all cells in higher 
organisms. The Na^-K^ pump is oriented so that it pumps Na^ out of the 
cell and into it. Furthermore, ATP must be on the inside of the cell to 
drive the pump. Ouabain, a specific inhibitor of the pump, is effective only 
if it is located outside. 

Membrane proteins have a unique orientation because they are synthe- 
sized and inserted into the membrane in an asymmetric manner. This ab- 
solute asymmetry is preserved because membrane proteins do not rotate 
from one side of the membrane to the other and because membranes are al- 
ways synthesized by the growth of preexisting membranes. Lipids, too, are 
asymmetrically distributed as a consequence of their mode of biosynthesis, 
but this asymmetry is usually not absolute, except for glycolipids. In the 
red-blood-cell membrane, sphingomyelin and phosphatidyl choline are 
preferentially located in the outer leaflet of the bilayer, whereas phos¬ 
phatidyl ethanolamine and phosphatidyl serine are located mainly in the in- 
ner leaflet. Large amounts of cholesterol are present in both leaflets. 


12.7 EUKARYOTIC CELLS CONTAIN COMPARTIVIENTS 
BOUNDED BY INTERNAL MEMBRANES 

Thus far we have considered only the plasma membrane of cells. Many bac¬ 
téria such as E. coli have two membranes separated by a cell wall (made of 
proteins, peptides, and carbohydrates) lying in between (Figure 12.35). The 
inner membrane acts as the permeability barrier, and the outer membrane and 




Internai Compartments 



CyLoplãsn^ 


the cell wall provide additional protection. The outer membrane is quite per- 
meable to small molecules owing to the presence of porins. The region be- 
tween the two membranes containing the cell wall is called theperiplasm. Other 
bactéria and archaea have only a single membrane surrounded by a cell wall. 

Eukaryotic cells, with the exception of plant cells, do not have cell walls, 
and their cell membranes consist of a single lipid bilayer. In plant cells, the 
cell wall is on the outside of the plasma membrane. Eukaryotic cells are dis- 
tinguished by the use of membranes inside the cell to form internai com¬ 
partments (Figure 12.36). For example, peroxisomes, organelles that play a 
major role in the oxidation of fatty acids for energy conversion, are defined 
by a single membrane. Mitochondria, the organelles in which ATP is syn- 
thesized, are surrounded by two membranes. Much like the case for a bac- 
terium, the outer membrane is quite permeable to small molecules, whereas 
the inner membrane is not. Indeed, considerable evidence now indicates that 
mitochondria evolved from bactéria by endosymbiosis (Section 18.12). A 
double membrane also surrounds the nucleus. However, the nuclear enve¬ 
lope is not continuous but, instead, consists of a set of closed membranes 
that come together at structures called nuclear pores. These pores regulate 
transport into and out of the nucleus. The nuclear membranes are linked to 
another membrane-defined structure, the endoplasmic reticulum, which 
plays a host of cellular roles, including drug detoxification and the modifi- 
cation of proteins for secretion (Section 11.4.1). Thus, a eukaryotic cell com- 
prises interacting compartments, and transport into and out of these com¬ 
partments is essential to many biochemical processes. 

12.7.1 Proteins Are Targeted to Specific Compartments 
by Signal Sequences 

The compartmentalization of eukaryotic cells makes possible many 
processes that must be separated from the remainder of the cellular envi- 
ronment to function properly. Specific proteins are found in peroxisomes, 
others in mitochondria, and still others in the nucleus. How do proteins end 
up in the proper compartment? Even for bactéria, some targeting of pro¬ 
teins is required: some proteins are secreted from the cell, whereas others 
remain in the cytosol. 

Proteins include specific sequences that serve as address labeis to direct 
the molecules to the proper location. For example, most peroxisomal pro¬ 
teins end with a sequence, Ser-Lys-Leu-COO~ , that acts as an autonomous 



FIGURE 12.35 Cell membranes of 
prokaryotes. A schematic view of the 
membrane in bacterial cells surrounded 
by (A) two membranes or (B) one 
membrane. 



FIGURE 12.36 Internai membranes of 
eukaryotes. Electron micrograph of a thin 
section of a hormone-secreting cell for the 
rat pituitary, showing the presence of 
internai structures bounded by membranes. 
[Biophoto Associates/Photo Researchers.] 
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FIGURE 12.37 A mítochondrial targetíng sequence. This sequence is recognized by 
receptors on the externai face of the outer mítochondrial membrane. A protein bearing the 
sequence wili be imported into the mitochondrion. Hydrophobic residues are shown in 
yellow, basic ones in blue, and serine and threonine in red. 




(B) 

FIGURE 12.38 Movement of a protein 
into the nucleus. Localization of 

(A) unmodified pyruvate kinase, and 

(B) pyruvate kinase containing a nuclear 
localization signal sequence attached to 
its amino terminus. The protein was 
visualized by fluorescence microscopy 
after staining with a specific antibody. 

[From W. D. Richardson, B. L. Roberts, and 

A. E. Snnith. Cell 44(1986):79.] 


targeting signal. The remo vai of this sequence from a protein that normally 
resides in peroxisomes blocks its import into that organelle, whereas the ad- 
dition of this sequence to a protein that normally resides in the cytosol can 
direct that protein to peroxisomes. A protein destined to pass through both 
mitochondrial membranes usually has a targeting sequence at its amino ter¬ 
minus (Figure 12.37). Unlike the peroxisomal targeting sequence, these 
amino-terminal sequences are highly variable; no clear consensus exists. 
They are typically from 15 to 35 residues long and rich in positively charged 
residues and in serines and threonines. Proteins destined for the nucleus have 
internai targeting sequences. A typical nuclear localization signal contains 
five consecutive positively charged residues such as Lys-Lys-Lys-Arg-Lys. 
The addition of such a sequence to a protein not found in the nucleus can 
direct it to the nucleus (Figure 12.38). Other sequences can direct proteins 
out of the nucleus. The known targeting sequences are given in Table 12.4. 


r 


TABLE 12.4 Targeting sequences 


Target 

Signal 

Nucleus 

-KKXK or -(K/R) 2 -X,o-i 2 -(K/R)* 

Peroxisome 

-SKL-COO“ 

Mitochondrion 

Aí-terminal amphipathic helix 

Endoplasmic reticulum 

-KDEL-COO~(ER retention) 


The ‘V’’ means that either K or R is required. 


Targeting sequences act by binding to specific proteins associated with 
each organelle. The determination of the structure of a protein, a-karyo- 
pherin, that binds to the nuclear localization signal reveals how the protein 
recognizes such a targeting sequence (Figure 12.39). A peptide containing 
the appropriate sequence binds to a specific site on the protein. The target 
peptide is held in an extended conformation through interactions between 
the target peptide backbone and asparagine side chains of the a-karyopherin 
while each of the basic residues lies in a deep pocket near the bottom, lined 
with negatively charged residues. Proteins that bind to the other targeting 
signal sequences presumably also have structures that allow recognition of 


FIGURE 12.39 Protein targeting 
signal recognition. The structure of the 
nuclear localization signal-binding protein 
a-karyopherin (also known as a-importin) 
with a nuclear localization signal peptide 
bound to its major recognition site. 


















the required features. Note that we have considered only how proteins are 
marked for different compartments. Later, we will consider the mechanisms 
by which proteins actually cross membranes (Section 29.x.x). 


Summary 


12.7.2 Membrane Budding and Fusion Underlie Several 
Important Biological Processes 

Membranes must be able to separate or join together to take up, transport, and 
release molecules. Many take up molecules through the process of receptor - 
mediated endocytosis (Figure 12.40). Here, a protein or larger complex initially 
binds to a receptor on the cell surface. After the protein is 
bound, specialized proteins act to cause the membrane in 
the vicinity of the bound protein to invaginate. The in- 
vaginated membrane eventually breaks off and fuses to 
form a vesicle. 

Receptor-mediated endocytosis plays a key role in cho- 
lesterol metabolism (Section 26.x.x). Some cholesterol in 
the blood is in the form of a lipid-protein complex called 
low-density lipoprotein (LDL). Low density lipoprotein 
binds to an LDL receptor, an integral membrane protein. 

The segment of the plasma membrane containing the 
LDL-LDL receptor complex then invaginates and buds 
off from the membrane. The LDL separates from the re¬ 
ceptor, which is recycled back to the membrane in a sep¬ 
arate vesicle. The vesicle containing the LDL fuses with 
a lysosome, an organelle containing an array of digestive 
enzymes. The cholesterol is released into the cell for stor- 
age or use in membrane biosynthesis, and the remaining 
protein components are degraded. Various hormones, 
transport proteins, and antibodies employ receptor- 
mediated endocytosis to gain entry into a cell. A less 
advantageous consequence is that this pathway is avail- 
able to viruses and toxins as a means of entry into cells. The reverse 
process—the fusion of a vesicle to a membrane—is a key step in the release 
of neurotransmitters from a neuron into the synaptic cleft (Figure 12.41). Al- 
though the processes of budding and fusion appear deceptively simple, the 
structures of the intermediates in the budding and fusing processes and the 
detailed mechanisms remain active areas of investigation. 



FIGURE 12.40 Receptor-mediated endocytosis. The process 
of receptor-mediated endocytosis is illustrated for the cholesterol- 
carrying complex, low-density lipoprotein (LDL): (1) LDL binds to 
a specific receptor, the LDL receptor; (2) this complex invaginates 
to form an internai vesicle; (3) after separation from its receptor, 
the LDL-containing vesicle fuses with a lysosome, leading to 
degradation of the LDL and release of the cholesterol. 


SUMMARY 


Many Common Features Underlie the Diversity of Biological Membranes 

Biological membranes are sheetlike structures, typically from 60 to 
100 À thick, that are composed of protein and lipid molecules held to¬ 
gether by noncovalent interactions. Membranes are highly selective per- 
meability barriers. They create closed compartments, which may be en- 
tire cells or organelles within a cell. Proteins in membranes regulate the 
molecular and ionic compositions of these compartments. Membranes 
also control the flow of Information between cells. 



Fatty Acids Are Key Constituents of Lipids 

Fatty acids are hydrocarbon chains of various lengths and degrees of un- 
saturation that terminate with a carboxylic acid group. The fatty acid 
chains in membranes usually contain between 14 and 24 carbon atoms; 
they may be saturated or unsaturated. Short chain length and unsatu- 
ration enhance the fluidity of fatty acids and their derivatives by lower- 
ing the melting temperature. 


FIGURE 12.41 Neurotransmitter 
release. Neurotransmitter-containing 
synaptic vesicles are arrayed near the 
plasma membrane of a nerve cell. 

Synaptic vesicles fuse with the plasma 
membrane, releasing the neurotransmitter 
into the synaptic cleft. [T. Reese/Don Fawcett/ 
Photo Researchers.] 
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and cholesterol. Phosphoglycerides, a type of phospholipid, consist of a 
glycerol backbone, two fatty acid chains, and a phosphorylated alcohol. 
Phosphatidyl choline, phosphatidyl serine, and phosphatidyl ethan- 
olamine are major phosphoglycerides. Sphingomyelin, a different type 
of phospholipid, contains a sphingosine backbone instead of glycerol. 
Glycolipids are sugar-containing lipids derived from sphingosine. Cho¬ 
lesterol, which modulates membrane fluidity, is constructed from a 
steroid nucleus. A common feature of these membrane lipids is that they 
are amphipathic molecules, having hydrophobic and hydrophilic ends. 

• Phospholipids and Glycolipids Readily Form Bimolecular Sheets 
in Aqueous Media 

Membrane lipids spontaneously form extensive bimolecular sheets in 
aqueous Solutions. The driving force for membrane formation is the 
hydrophobic interactions among the fatty acid tails of membrane lipids. 
The hydrophilic head groups interact with the aqueous médium. Lipid 
bilayers are cooper ative structures, held together by many weak bonds. 
These lipid bilayers are highly impermeable to ions and most polar mol¬ 
ecules, yet they are quite fluid, which enables them to act as a solvent 
for membrane proteins. 

• Proteins Carry Out Most Membrane Processes 

Specific proteins mediate distinctive membrane functions such as trans- 
port, communication, and energy transduction. Many integral mem¬ 
brane proteins span the lipid bilayer, whereas others are partly embed- 
ded in the membrane. Peripheral membrane proteins are bound to 
membrane surfaces by electrostatic and hydrogen-bond interactions. 
Membrane-spanning proteins have regular structures, including (3 
strands, although the a helix is the most common membrane-spanning 
domain. Indeed, sequences of 20 consecutive nonpolar amino acids can 
be diagnostic of a membrane-spanning a-helical region of a protein. 

I Lipids and Many Membrane Proteins Diffuse Rapidiy in the Plane 
of the Membrane 

Membranes are structurally and functionally asymmetric, as exempli- 
fied by the restriction of sugar residues to the externai surface of mam- 
malian plasma membranes. Membranes are dynamic structures in which 
proteins and lipids diffuse rapidly in the plane of the membrane (lateral 
diffusion), unless restricted by special interactions. In contrast, the ro- 
tation of lipids from one face of a membrane to the other (transverse dif¬ 
fusion, or flip-flop) is usually very slow. Proteins do not rotate across bi¬ 
layers; hence, membrane asymmetry can be preserved. The degree of 
fluidity of a membrane partly depends on the chain length of its lipids 
and the extent to which their constituent fatty acids are unsaturated. In 
animais, cholesterol content also regulates membrane fluidity. 

• Eukaryotic Cells Contain Compartments Bounded by Internai Membranes 

An extensive array of internai membranes in eukaryotes creates compart¬ 
ments within a cell for distinct biochemical functions. For instance, a dou- 
ble membrane surrounds the nucleus, the location of most of the cells ge- 
netic material, and the mitochondria, the location of most ATP synthesis. 
A single membrane defines the other internai compartments, such as the 
endoplasmic reticulum. Some compartments can exchange material by the 
process of membrane budding and fusion. As with all membranes, the 
proteins associated with these membranes determine the specific bio¬ 
chemical function. Specific amino acid sequences in the proteins direct 
these molecules to the appropriate compartment. 
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-1 KEY TERMS 

fatty acid (p. 320) 
phospholipid (p. 322) 
sphingosine (p. 322) 
phosphoglyceride (p. 322) 
sphingomyelin (p. 323) 
glycolipid (p. 324) 
cerebroside (p. 325) 


ganglioside (p. 325) 
cholesterol (p. 325) 
amphipathic molecule (p. 325) 
lipid bilayer (p. 326) 
liposome (p. 327) 

integral membrane protein (p. 329) 
peripheral membrane protein (p. 329) 


hydropathy plot (p. 335) 
lateral diffusion (p. 335) 
fluid mosaic model (p. 336) 
targeting sequence (p. 340) 
receptor-mediated endocytosis (p. 341) 


- ^SELECTED READINGSj - 

Where to start 

De Weer, R, 2000. A century of thinking about cell membranes. Annu. 
Rev. Physiol. 62:919-926. 

Bretscher, M. S., 1985. The molecules of the cell membrane. Sei. Am. 
253(4):100-108. 

Unwin, N., and Henderson, R., 1984. The strueture of proteins in bi- 
ological membranes. Sei. Am. 250(2):78-94. 

Deisenhofer, J., and Michel, H., 1989. The photosynthetic reaction cen¬ 
tre from the purple bacterium Rhodopseudomonas viridis. EMBO J. 
8:2149-2170. 

Singer, S. J., and Nicolson, G. L., 1972. The fluid mosaic model of the 
strueture of cell membranes. Seienee 175:720-731. 

Jacobson, K., Sheets, E. D., and Simson, R., 1995. Revisiting the fluid 
mosaic model of membranes. Seienee 268:1441-1442. 

Books 

Gennis, R. B., 1989. Biomembranes: Moleeular Strueture and Funetion. 
Springer Verlag. 

Vance, D. E., and Vance, J. E. (Eds.)., 1996. Bioehemistry of Lipids, 
Lipoproteins, and Membranes. Elsevier. 

Lipowsky, R., and Sackmann, E., 1995. The strueture and dynamies of 
membranes. Elsevier. 

Racker, E., 1985. Reeonstitutions of Transporters, Reeeptors, and Patho- 
logieal States. Academic Press. 

Tanford, C., 1980. The Hydrophobie Effeet: Formation of Mieelles and 
Biologieal Membranes (2d ed.). Wiley-Interscience. 

Membrane lipids and dynamies 

Saxton, M. J., and Jacobson, K., 1997. Single-particletracking: Applica¬ 
tions to membrane dynamies. Annu. Rev. Biophys. Biomol. Struet. 
26:373-399. 

Bloom, M., Evans, E., Mouritsen, O. G., 1991. Physical properties of 
the fluid lipid-bilayer component of cell membranes: A perspec¬ 
tive. Q. Rev. Biophys. 24:293-397. 

Elson, E. L., 1986. Membrane dynamies studied by fluorescence cor- 
relation spectroscopy and photobleaching recovery. Soe. Gen. Phys¬ 
iol. Ser. 40:367-383. 

Zachowski, A., and Devaux, P. K, 1990. Transmembrane movements 
of lipids. Experientia 46:644-656. 

Devaux, P. K, 1992. Protein involvement in transmembrane lipid asym- 
metry. Annu. Rev. Biophys. Biomol. Struet. 21:417-439. 

Silvius, J. R., 1992. Solubilization and functional reconstitution of bio- 
membranecomponents. Annu. Rev. Biophys. Biomol. Struet. 21:323-348. 

Yeagle, P. L., Albert, A. D., Boesze-Battaglia, K., Young, J., and Erye, 
J., 1990. Cholesterol dynamies in membranes. Biophys. f 57:413M24. 

Nagle, J. K, and Tristram-Nagle, S., 2000. Lipid bilayer strueture. Curr. 
Opin. Struet. Biol. 10: 474-480. 


Dowhan, W., 1997. Molecular basis for membrane phospholipid diver- 
sity: Why are there so many lipids? Annu. Rev. Bioehem. 66: 
199-232. 

Huijbregts, R. P. H., de Kroon, A. I. P. M., and de Kruijff, B., 1998. 
Rapid transmembrane movement of newly synthesized phos- 
phatidylethanolamine across the inner membrane of Eseheriehia 
eoli.f Biol.Chem. 273:18936-18942. 

Strueture of membrane proteins 

Popot, J.-L., and Engleman, D. M., 2000. Helical membrane protein 
folding, stability and evolution. Annu. Rev. Bioehem. 69:881-922. 

White, S. H., and Wimley, W. C., 1999. Membrane protein folding and 
stability: Physical principies. Annu. Rev. Biophys. Biomol. Struet. 
28:319-365. 

Marassi, E. M., and Opella, S. J., 1998. NMR structural studies of mem¬ 
brane proteins. Curr. Opin. Struet. Biol. 8:640-648. 

Lipowsky, R., 1991. The conformation of membranes. Nature 349: 
475-481. 

Altenbach, C., Marti, T., Khorana, H. G., and Hubbell, W. L., 1990. 
Transmembrane protein strueture: Spin labeling of bacterio- 
rhodopsin mutants. Seienee 248:1088-1092. 

Easman, G. D., and Gilbert, W. A., 1990. The prediction of trans¬ 
membrane protein sequences and their conformation: An evalua- 
tion. Trends Bioehem. Sei. 15:89-92. 

Jennings, M. L., 1989. Topography of membrane proteins. Annu. Rev. 
Bioehem. 58:999-1027. 

Engelman, D. M., Steitz, T. A., and Goldman, A., 1986. Identifying 
non-polar transbilayer helices in amino acid sequences of mem¬ 
brane proteins. Annu. Rev. Biophys. Biophys. Chem. 15:321-353. 

Undenfriend, S., and Kodukola, K., 1995. How glycosyl-phosphatidyl- 
inositiol-anchored membrane proteins are made. Annu. Rev. 
Bioehem. 64:563-591. 

Intracellular membranes 

Skehel, J. J., and Wiley, D. C., 2000. Receptor binding and membrane 
fusion in virus entry: The influenza hemagglutinin. Annu. Rev. 
Bioehem. 69:531-569. 

Roth, M. G., 1999. Lipid regulators of membrane traffic through the 
Golgi complex. Trends Cell Biol. 9:174-179. 

Jahn, R., and Sudhof, T. C., 1999. Membrane fusion and exocytosis. 
Annu. Rev. Bioehem. 68:863-911. 

Stroud, R. M., and Walter, R, 1999. Signal sequence recognition and 
protein targeting. Curr. Opin. Struet. Biol. 9:754-759. 

Teter, S. A., and Klionsky, D. J., 1999. How to get a folded protein 
across a membrane. Trends Cell Biol. 9:428-31. 

Hettema, E. H., Distei, B., and Tabak, H. K, 1999. Import of proteins 
into peroxisomes. Bioehim. Biophys. Aeta 1451:17-34. 


PROBLEMS 


1. Population density. How many phospholipid molecules are 
there in a l-|JLm^ region of a phospholipid bilayer membrane? 
Assume that a phospholipid molecule oceupies 70 A^ of the sur- 
face area. 


2. Lipid diffusion. What is the average distance traversed by a 
membrane lipid in 1 ps, 1 ms, and 1 s? Assume a diffusion co- 
efficient of 10~^ cm^s~^. 
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3. Protein diffusion. The diffusion coefficient, D, of a rigid 
spherical molecule is given by 

D = kP/ÒTTiqr 

in which T\ is the viscosity of the solvent, r is the radius of the 
sphere, k is the Boltzman constant (1.38 X erg/degree), 

and T is the absolute temperature. What is the diffusion coeffi¬ 
cient at 37°C of a 100-kd protein in a membrane that has an ef- 
fective viscosity of 1 poise (1 poise = 1 erg s~^/cm~^)? What is 
the average distance traversed by this protein in 1 ps, 1 ms, and 
1 s? Assume that this protein is an unhydrated, rigid sphere of 
density 1.35 g cm~^. 

4. Cold sensitivity. Some antibiotics act as carriers that bind an 
ion on one side of a membrane, diffuse through the membrane, 
and release the ion on the other side. The conductance of a lipid- 
bilayer membrane containing a carrier antibiotic decreased abrupt- 
ly when the temper ature was lowered from 40°Cto36°C. In con- 
trast, there was little change in conductance of the same bilayer 
membrane when it contained a channel-forming antibiotic. Why? 

5. Flip-flop. The transverse diffusion of phospholipids in a bi¬ 
layer membrane was investigated by using a paramagnetic ana- 
logofphosphatidyl choline, called spin-labeledphosphatidyl choline. 


H3C(H2C)i4 


H3C(H2C)i4 



Spin-labeled phosphatidyl choline 


The nitroxide (NO) group in spin-labeled phosphatidyl choline 
gives a distinctive paramagnetic resonance spectrum. This spec- 
trum disappears when nitroxides are converted into amines by 
reducing agents such as ascorbate. 

Lipid vesicles containing phosphatidyl choline (95%) and 
the spin-labeled analog (5%) were prepared by sonication and 
purified by gel-filtration chromatography. The outside diame- 
ter of these liposomes was about 25 nm. The amplitude of the 
paramagnetic resonance spectrum decreased to 35% of its initial 
value within a few minutes of the addition of ascorbate. There 
was no detectable change in the spectrum within a few minutes 
after the addition of a second aliquot of ascorbate. However, the 
amplitude of the residual spectrum decayed exponentially with 
a half-time of 6.5 hours. How would you interpret these changes 
in the amplitude of the paramagnetic spectrum? 

6. Flip-flop 2. Although proteins rarely if ever flip-flop across 
a membrane, distribution of membrane lipids between the mem¬ 
brane leaflets is not absolute except in the case of glycolipids. 
Why are glycosylated lipids less likely to flip-flop? 

7. Cis versus trans. Why are most unsaturated fatty acids found 
in phospholipids in the cis rather than the trans conformation? 
Draw the structure of a 16-carbon fatty acid as saturated, trans 
monounsaturated, and cis monounsaturated. 

8. A question of competition. Would a homopolymer of alanine 
be more likely to form an a helix in water or in a hydrophobic 
médium? Explain. 


9. Maintaining fluidity. A culture of bactéria growing at 37°C 
was shifted to 25°C. How would you expect this shift to alter the 
fatty acid composition of the membrane phospholipids? Explain. 

Data Interpretation Problems 

10. Cholesterol effects. The red line on the foliowing graph shows 
the fluidity of the fatty acids of a phospholipid bilayer as a func- 
tion of temperature. The blue line shows the fluidity in the pres- 
ence of cholesterol. 



Temperature —^ 

(a) What is the effect of cholesterol? 

(b) Why might this effect be biologically important? 

11. Hydropathy plots. On the basis of the folio wing hydropathy 
plots for three proteins, predict which would be membrane pro¬ 
teins. What are the ambiguities with respect to using such plots 
to determine if a protein is a membrane protein? 





Chapter Integration Problem 

12. The proper environment. An understanding of the structure 
and function of membrane proteins has lagged behind that of other 
proteins. The primary reason is that membrane proteins are more 
difficult to purify and crystallize. Why might this be the case? 















Membrane Channels and Pumps 



The flow of íons through a single membrane channel (channels are shown in red 
ín the íllustration at the left) can be detected by the patch clamp techníque, 
whích records current changes as the channel transits between the open and 
closed States. [(Left) After E. Neher and B. Sakmann. The patch clamp techníque. Copyright © 1992 
by Scientific American, Inc. All rights reserved. (Right) Courtesy of Dr. Maurício Montai.] 


OUTLINE 


The lipid bilayer of biological membranes, as discussed in Chapter 12, is 
intrinsically impermeable to ions and polar molecules. Permeability is con- 
ferred by two classes of membrane proteins, pumps and channels. Pumps use 
a source of free energy such as ATP or light to drive the 
thermodynamically uphill transport of ions or molecules. 

Pump action is an example of active transport. Channels, in 
contrast, enable ions to flow rapidly through membranes in 
a downhill direction. Channel action illustrates passive 
transport, or facilitated diffusion. 

Pumps are energy transducers in that they convert one 
form of free energy into another. Two types of ATP-driven 
pumps, P-type ATPases and the ATP-binding cassette 
pumps, undergo conformational changes on ATP binding 
and hydrolysis that cause a bound ion to be transported 
across the membrane. Phosphorylation and dephosphory- 
lation of both the Ca^^-ATPase and the Na^-K^-ATPase 
pumps, which are representative of P-type ATPase, are 
coupled to changes in orientation and affinity of their ion- 
binding sites. 

A different mechanism of active transport, one that uti¬ 
lizes the gradient of one ion to drive the active transport of 
another, will be illustrated by the sodium-calcium ex- 
changer. This pump plays an important role in extruding 
Ca^^ from cells. 

We begin our examination of channels with the acetyl- 
choline receptor, a channel that mediates the transmission 
of nerve signals across synapses, the functional junctions 
between neurons. The acetylcholine receptor is a ligand- 
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FIGURE 13.1 Acetylcholíne receptors. 

An electron micrograph shows the densely 
packed acetylcholíne receptors embedded 
in a postsynaptic membrane. [Courtesy of 
Dr. John Heuser and Dr. Shelly Salpeter.] 


gated channel in that the channel opens in response to the binding of acetyl- 
choline (Figure 13.1). In contrast, the sodium and potassium channels, 
which mediate action potentials in neuron axon membranes, are opened by 
membrane depolarization rather than by the binding of an allosteric effec- 
tor. These channels are voltage-gated. These channels are also of interest 
because they swiftly and deftly distinguish between quite similar ions (e.g., 
Na^ and K^). The flow of ions through a single channel in a membrane 
can readily be detected by using the patch-clamp technique. 

The chapter concludes with a view of a different kind of channel—the 
cell-to-cell channel, or gap junction. These channels allow the transport of 
ions and metabolites between cells. 


13.1 THE TRANSPORT OF MOLECULES 

ACROSS A MEMBRANE MAY BE ACTIVE OR PASSIVE 

Before we consider the specifics of membrane-protein function, we will 
consider some general principies of membrane transport. Two factors 
determine whether a molecule will cross a membrane: (1) the permeabil- 
ity of the molecule in a lipid bilayer and (2) the availability of an energy 
source. 

13.1.1 Many Molecules Require Protein Transporters 
to Cross Membranes 

As discussed in Chapter 12, some molecules can pass through cell mem¬ 
branes because they dissolve in the lipid bilayer. Such molecules are called 
lipophilic molecules. The steroid hormones provide a physiological example. 
These cholesterol relatives can pass through a membrane in their path of 
movement, but what determines the direction in which they will move? Such 
molecules will pass through a membrane located down their concentration 
gradient in a process called simple diffusion. In accord with the Second Law 
of Thermodynamics, molecules spontaneously move from a region of higher 
concentration to one of lower concentration. Thus, in this case, an entropy 
increase powers transport across the membrane. 

Matters become more complicated when the molecule is highly polar. 
For example, sodium ions are present at 143 mM outside the cell and 
14 mM inside the cell, yet sodium does not freely enter the cell because the 
positively charged ion cannot pass through the hydrophobic membrane in¬ 
terior. In some circumstances, as during a nerve impulse (Section 13.5.3), 
sodium ions must enter the cell. How are they able to do so? Sodium ions 
pass through specific channels in the hydrophobic barrier formed by mem¬ 
brane proteins. This means of Crossing the membrane is called facilitated 
diffusion, because the diffusion across the membrane is facilitated by the 
channel. It is also called passive transport, because the energy driving the 
ion movement originates from the ion gradient itself, without any contri- 
bution by the transport system. Channels, like enzymes, display substrate 
specificity. 

How is the sodium gradient established in the first place? In this case, 
sodium must move, or be pumped, against a concentration gradient. Be¬ 
cause moving the ion from a low concentration to a higher concentration re- 
sults in a decrease in entropy, it requires an input of free energy. Protein 
transporters embedded in the membrane are capable of using an energy 
source to move the molecule up a concentration gradient. Because an input 
of energy from another source is required, this means of Crossing the mem¬ 
brane is called active transport. 



13.1.2 Free Energy Stored in Concentration Gradients 
Can Be Quantified 

An unequal distribution of molecules is an energy-rich condition because 
free energy is minimized when all concentrations are equal. Consequently, 
to attain such an unequal distribution of molecules, called a concentration 
gradient, requires an input of free energy. In fact, the creation of a concen¬ 
tration gradient is the result of active transport. Can we quantify the amount 
of energy required to generate a concentration gradient (Figure 13.2)? Con- 
sider an uncharged solute molecule. The free-energy change in transport- 
ing this species from side 1, where it is present at a concentration of c ^, to 
side 2, where it is present at concentration C 2 , is 

AG = RTln{c2/ci) = 2.303RT logio(c2/ci) 

For a charged species, the unequal distribution across the membrane generates 
an electrical potential that also must be considered because the ions will be re- 
pelled by the like charges. The sum of the concentration and electrical terms is 
called the electrochemical potential. The free-energy change is then given by 

AG = RTln{c2/c,) + ZFAV = 2.303RTlogio(c2/ci) + ZFAV 

in which Z is the electrical charge of the transported species, A V is the po¬ 
tential in volts across the membrane, and F is the faraday [23.1 kcal 
mor^(96.5kJV“^mor^)]. 

A transport process must be active when AG is positive, whereas it can 
be passive when AG is negative. For example, consider the transport of an 
uncharged molecule from c^ = 10 ~^Mtoc 2 = 10~^M. 

AG = 2.303FTlogio(10“VlO“^) 

= 2.303 X 1.99 X 298 X 2 
= +2.7 kcal moF^ ( +11.3 kj moF^) 

At 25°C (298 K), AG is + 2.7 kcal moF^ ( + 11.3 kj moF^), indicating 
that this transport process requires an input of free energy. It could be 
driven, for example, by the hydrolysis of ATP, which yields —12 kcal 
moF^ ( — 50.2 kJ moF^) under typical cellular conditions. If AG is neg¬ 
ative, the transport process can occur spontaneously without free-energy 
input. 

lon gradients are important energy storage forms in all biological Sys¬ 
tems. For instance, bacteriorhodopsin (Section 12.4.2) generates a proton 
gradient at the expense of light energy, an example of active transport. The 
energy of the proton gradient in turn can be converted into chemical energy 
in the form of ATP. This example illustrates the use of membranes and 
membrane proteins to transform energy: from light energy into an ion gra¬ 
dient into chemical energy. 


13.2 A FAMILY OF MEMBRANE PROTEINS USES ATP 
HYDROLYSIS TO PUMP IONS ACROSS MEMBRANES 

The extracellular fluid of animal cells has a salt concentration similar to that 
of sea water. However, cells must control their intracellular salt concentra¬ 
tions to prevent unfavorable interactions with high concentrations of ions 
such as calcium and to facilitate specific processes. For instance, most ani¬ 
mal cells contain a high concentration of and a low concentration of Na^ 
relative to the externai médium. These ionic gradients are generated by a 
specific transport system, an enzyme that is called the Na^-K'^ pump or the 
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FIGURE 13.2 Free energy and transport. 

The free-energy change in transporting (A) 
an uncharged solute from a compartment 
at concentration C] to one at C 2 and (B) a 
singly charged species across a membrane 
to the side having the same charge as that 
of the transported ion. Note that the free- 
energy change imposed by a membrane 
potential of 59 mV is equivalent to that 
imposed by a concentration ratio of 10 for 
a singly charged ion at 25°C. 
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FIGURE 13.3 Phosphoaspartate. 

Phosphoaspartate (aiso referred to as p- 
aspartyl phosphate) is a key intermediate 
in the reaction cycles of P-type ATPases. 
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FIGURE 13.4 Structure of SR 
Ca^"^ ATPase. This enzyme, the calcium 
pump of the sarcoplasmic reticulum, 
comprises a membrane-spanning domain 
of 10 a helices and a cytoplasmic headpiece 
consisting of three domains (N, P, and A). 
Two calcium ions (green) bind within the 
membrane-spanning region. The aspartate 
residue characteristic of this protein family 
is indicated. 


Na^-K^ ATPase. The hydrolysis of ATP by the pump provides the energy 
needed for the active transport of Na^ out of the cell and into the cell, 
generating the gradient. The pump is called the Na^-K^ ATPase because 
the hydrolysis of ATP occurs only when Na^ and are bound to the 
pump. Moreover, this ATPase, like all such enzymes, requires Mg^^ (Sec- 
tion 9.4.2). The active transport of Na^ and is of great physiological sig- 
nificance. Indeed, more than a third of the ATP consumed by a resting an¬ 
imal is used to pump these ions. The Na^-K^ gradient in animal cells 
Controls cell volume, renders neurons and muscle cells electrically excitable, 
and drives the active transport of sugars and amino acids. 

Y The subsequent purification of other ion pumps has revealed a large 
family of evolutionarily related ion pumps including proteins from 
bactéria, archaea, and all eukaryotes. These pumps are specific for an array 
of ions. Of particular interest are the Ca^^ ATPase, the enzyme that trans- 
ports Ca^^ out of the cytoplasm and into the sarcoplasmic reticulum of mus¬ 
cle cells, and the gastric - K^ ATPase, the enzyme responsible for pump- 

ing sufficient protons into the stomach to lower the pH below 1.0. These 
enzymes and the hundreds of known homologs, including the Na^-K^ 
ATPase, are referred to as P-type ATPases because they form a key phos- 
phorylated intermediate. In the formation of this intermediate, a phospho- 
ryl group obtained from the hydrolysis of ATP is linked to the side chain 
of a specific conserved aspartate residue in the ATPase (Figure 13.3). 

13.2.1 The Sarcoplasmic Reticulum Ca^^ ATPase Is an 
Integral Membrane Protein 

We will consider the structural and mechanistic features of these enzymes 
by examining the Ca^^ ATPase found in the sarcoplasmic reticulum (SR 
Ca^^ ATPase) of muscle cells. This enzyme, which constitutes 80% of the 
sarcoplasmic reticulum membrane protein, plays an important role in mus¬ 
cle contraction, which is triggered by an abrupt rise in the cytosolic calcium 
levei. Muscle relaxation depends on the rapid removal of Ca^^ from the 
cytosol into the sarcoplasmic reticulum, a specialized compartment for cal¬ 
cium storage, by the SR Ca^^ ATPase. This pump maintains a Ca^^ con- 
centration of approximately 0.1 |ulM in the cytosol compared with 1.5 mM 
in the sarcoplasmic reticulum. 

The SR Ca^^ ATPase is a single 110-kd polypeptide with a transmem- 
brane domain consisting of 10 a helices. A large cytoplasmic head piece 
constitutes nearly half the molecular weight of the protein and consists of 
three distinct domains (Figure 13.4). The three cytoplasmic domains of the 
SR Ca^^ ATPase have distinct functions. One domain (N) binds the ATP 
nucleotide, another (P) accepts the phosphoryl group on its conserved as¬ 
partate residue, and the third (A) may serve as an actuator for the N do¬ 
main. The relation between these three domains changes significantly on 
ATP hydrolysis. The crystal structure in the absence of ATP shows the 
likely nucleotide-binding site separated by more than 25 A from the phos- 
phorylation site, suggesting that the N and P domains move toward one an¬ 
other during the catalytic cycle. This closure is facilitated by ATP binding 
and by the binding of Ca^^ to the membrane-spanning helices. 

The results of mechanistic studies of the SR Ca^^ ATPase and other 
P-type ATPases have revealed two common features. First, as we have seen, 
each protein can be phosphorylated on a specific aspartate residue. For the SR 
Ca^^ ATPase, this reaction takes place at Asp 351 only in the presence of rel- 
atively high cytosolic concentrations of Ca^^. Second, each pump can inter- 
convert between at least two different conformations, denoted by Ei and E 2 . 
Thus, at least four conformational States—Ei, Ei-P, E 2 -P, and E 2 —participate 
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FIGURE 13.5 Mechanism of P-type 
ATPase action. The binding of 
and the phosphorylation of the ATPase 
(steps 1 and 2), illustrated here for the 
Ca^^ ATPase, lead to the eversion of the 
binding sites (step 3) and the release of 
Ca^^ to the luminal side of the membrane 
(step 4). Hydrolysis of phosphoaspartate 
(step 5) and eversion (step 6) reset the 
enzyme to its initial State. 


in the transport process. From these four States, it is possible to construct a 
plausible mechanism of action for these enzymes, although further studies are 
necessary to confirm the mechanism and provide more details (Figure 13.5): 

1. The postulated reaction cycle begins with the binding of ATP and two 
Ca^^ ions to the Ei state. 

2. The enzyme cleaves ATP, transferring the y-phosphoryl group to the 
key aspartate residue. Calcium must be bound to the enzyme for the phos¬ 
phorylation to take place. Phosphorylation shifts the conformational equi- 
librium of the ATPase toward E 2 . 

3. The transition from the Ei to the E 2 state causes the ion-binding sites to 
‘‘evert’' so that the ions can dissociate only to the luminal side of the membrane. 

4. In the E 2 -P state, the enzyme has low affinity for the Ca^^ ions, so they 
are released. 

5. With the release of Ca^^, the phosphoaspartate residue is hydrolyzed, 
and the phosphate group is released. 

6. The enzyme, devoid of a covalently attached phosphoryl group, is not 
stable in the E 2 form. It everts back to the Ei form, completing the cycle. 

Essentially the same mechanism is employed by the Na^-K^ ATPase. 
The El State binds three Na^ ions and transports them across the mem¬ 
brane and out of the cell as a result of the protein’s phosphorylation and 
transition to the E 2 state. The three Na^ ions are released into the extra- 
cellular médium. The E 2 state of this enzyme also binds ions—namely, two 
ions. These ions are carried across the membrane in the opposite di- 
rection by eversion driven by the hydrolysis of the phosphoaspartate residue 
and are released into the cytosol. 

The change in free energy accompanying the transport of Na^ and 
can be calculated (Section 13.1.1). Suppose that the concentration of Na^ 
outside and inside the cell is 143 and 14 mM, respectively, and that of 
is 4 and 157 mM. At a membrane potential of — 50 mV and a temper- 
ature of 37 °C, the free-energy change in transporting 3 moles of Na^ out 
of and 2 moles of into the cell is +10.0 kcal ( + 41.8 kj moP^). The hy¬ 
drolysis of a single ATP per transport cycle provides sufficient free energy, 
about —12 kcal moP^ ( — 50 kJ mol~^) under cellular conditions, to drive 
the uphill transport of these ions. 









































Flippase 



FIGURE 13.6 P-type ATPases can 
transport lípids. Flippases are enzymes 
that maintain membrane asymmetry by 
"flipping" phospholipids (displayed with a 
red head group) from the outer to the 
inner layer of the membrane. 



Foxglove (Digitalis purpured) is the source 
of digitalis, one of the most widely used 
drugs. [Inga Spence/Visuais Uniimited.] 


13.2.2 P-Type ATPases Are Evolutionarily Conserved and Play a 
Wide Range of Roles 

Y Analysis of the complete yeast genome revealed the presence of 16 
proteins that clearly belong to the P-type ATPase family. More de- 
tailed sequence analysis suggests that 2 of these proteins transport ions, 
2 transport Ca^^, 3 transport Na^, and 2 transport metais such as Cu^^. 
In addition, 5 members of this family appear to participate in the transport 
of phospholipids with amino acid head groups. These latter proteins assist 
in the maintenance of membrane asymmetry by transporting lipids such as 
phosphatidyl serine from the outer to the inner leaflet of the bilayer mem¬ 
brane (Figure 13.6). Such enzymes have been termed ‘Tlippases.'' 

All members of this protein family employ the same fundamental mech- 
anism. The free energy of ATP hydrolysis drives membrane transport by 
effecting conformational changes associated with the addition and removal 
of a phosphoryl group at an analogous aspartate site in each protein. 

13.2.3 Digitalis Specifically Inhibits the Na^-K^ Pump 
by Blocking Its Dephosphorylation 

Certain steroids derived from plants are potent inhibitors (Ki — 10 nM) of 
the Na^-K^ pump. Digitoxigenin and ouabain are members of this class of 
inhibitors, which are known as cardiotonic steroids because of their strong 
effects on the heart (Figure 13.7). These compounds inhibit the dephos¬ 
phorylation of the E 2 -P form of the ATPase when applied on the extracell- 
ular face of the membrane. 
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FIGURE 13.7 Digitoxigenin. Cardiotonic 
steroids such as digitoxigenin inhibit the Na^ 
pump by blocking the dephosphorylation 
of E 2 -P. 


T Digitalis, a mixture of cardiotonic steroids derived from the dried 
leaf of the foxglove plant (Digitalis purpurea), is of great clinicai sig- 
nificance. Digitalis increases the force of contraction of heart muscle, which 
makes it a choice drug in the treatment of congestive heart failure. Inhibi- 
tion of the Na^-K^ pump by digitalis leads to a higher levei of Na^ inside 
the cell. The diminished Na^ gradient results in slower extrusion of Ca^^ 
by the sodium-calcium exchanger (Section 13.4). The subsequent increase 
in the intracellular levei of Ca^^ enhances the contractility of cardiac mus¬ 
cle. It is interesting to note that digitalis was effectively used long before the 
discovery of the Na^-K^ ATPase. In 1785, William Withering, a British 
physician, heard tales of an elderly woman, known as ‘‘the old woman of 
Shropshire,’' who cured people of “dropsy'' (which today would be recog- 
nized as congestive heart failure) with an extract of foxglove. Withering con- 
ducted the first scientific study of the effects of foxglove on congestive heart 
failure and documented its effectiveness. 







13.3 MULTIDRUG RESISTANCE AND CYSTIC FIBROSIS 
HIGHLIGHT A FAMILY OF MEMBRANE PROTEINS 
WITH ATP-BINDING CASSETTE DOMAINS 


ATP-Binding Cassettes 


Tumor cells in culture often become resistant to drugs that were 
^ initially quite toxic to the cells. Remarkably, the development of re- 
sistance to one drug also makes the cells less sensitive to a range of other 
compounds. This phenomenon is known as multidrug resistance. In a sig- 
nificant discovery, the onset of multidrug resistance was found to correlate 
with the expression and activity of a membrane protein with an apparent 
molecular mass of 170 kd. This protein acts as an ATP-dependent pump 
that extrudes a wide range of small molecules from cells that express it. 
The protein is called the multidrug resistance protein (MDR) or P-glyco- 
protein (‘‘glyco'' because it includes a carbohydrate moiety). Thus, when 
cells are exposed to a drug, the MDR pumps the drug out of the cell be- 
fore the drug can exert its effects. A related protein was discovered through 
genetic studies of the hereditary disease cystic fibrosis (Section 1.1.4). In one 
of the first studies leading to the identification of a specific genetic change 
causing human disease, investigators performed a comparative genetic 
analysis of many people having this disease and family members who did 
not have the disease. The gene found to be mutated in the affected per sons 
encodes a protein, now called cystic fibrosis transmembrane conductance reg- 
ulator (CFTR). CFTR acts as an ATP-regulated chloride channel in the 
plasma membranes of epithelial cells. As mentioned in Chapter 1, cystic 
fibrosis results from a decrease in fluid and salt secretion by CFTR. As a 
consequence of this defect, secretion from the pancreas is blocked and 
heavy, dehydrated mucus accumulates in the lungs, leading to chronic lung 
infections. 

r Analysis of the amino acid sequences of MDR, CFTR, and homo- 
logous proteins revealed a common architecture (Figure 13.8). Each 
protein comprises four domains: two membrane-binding domains of un- 
known structure and two ATP-binding domains. The ATP-binding do¬ 
mains of these proteins are called ATP-binding cassettes (or ABCs) and are 
homologous to domains in a large family of transport proteins of bactéria 
and archaea. Indeed, with 79 members, the ABC proteins are the largest 
single family identified in the E. coli genome. The ABC proteins are mem¬ 
bers of the P-loop NTPase superfamily (Section 9.4.4). Some ABC proteins, 
particularly those of prokaryotes, are multisubunit proteins constructed 
such that the membrane-spanning domains and the ABC domains are pres- 
ent on separate polypeptide chains. The consolidation of enzymatic activi- 
ties of several polypeptide chains in prokaryotes to a single chain in 
eukaryotes is a theme that we will see again (Section 22.4.4). For example, 
the histidine permease of Salmonella typhimurium, which transports the 
amino acid histidine into the bacterium, consists of (1) two different pro¬ 
tein subunits with membrane-spanning domains (HisQand HisM) and (2) 
a homodimeric protein (HisP) with ABC domains (Figure 13.9). A soluble, 
histidine-binding protein (HisJ) binds histidine after the amino acid enters 
the cell. 

Like other members of the P-loop NTPase superfamily, proteins with 
ABC domains undergo conformational changes on ATP binding and hy- 
drolysis. These structural changes are coupled within each dimeric trans- 
porter unit in a manner that allows these membrane proteins to drive the 
uptake or efflux of specific compounds or to act as gates for open membrane 
channels. 
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FIGURE 13.8 ABC transporters. The 

nnultidrug resistance protein (MDR) and 
the cystic fibrosis transmembrane regulator 
(CFTR) are homologous proteins 
composed of two transmembrane 
domains and two ATP-binding domains, 
called ATP-binding cassettes (ABCs). 



FIGURE 13.9 Histidine permease. In 

the histidine permease of S. typhimurium, 
the membrane-spanning regions (yellow 
and orange) and ABC regions (blue) are 
on separate polypeptide chains (compare 
with Figure 13.8). ATP hydrolysis drives the 
transport of histidine into the cell. 
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FIGURE 13.10 Secondary transporters. 

These transporters employ the downhili 
flow of one gradient to power the 
formation of another gradient. In 
antiporters, the Chemical species move 
in opposite directions. In symporters, the 
two species move in the same direction. 


FIGURE 13.11 Action of lactose 
permease. Lactose permease pumps 
lactose into bacterial cells by drawing on 
the proton-motive force. The binding sites 
evert when a lactose molecule (L) and a 
proton (H^) are bound to externai sites. 
After these species are released inside the 
cell, the binding sites again evert to 
complete the transport cycle. Lactose 
permease is an example of a symporter. 


13.4 SECONDARY TRANSPORTERS USE ONE 
CONCENTRATION GRADIENT TO POWER THE 
FORMATION OF ANOTHER 

Many active-transport processes are not directly driven by the hydrolysis 
of ATP. Instead, the thermodynamically uphill flow of one species of ion 
or molecule is coupled to the downhili flow of a different species. Mem¬ 
brane proteins that pump ions or molecules uphill by this means are termed 
secondary transporters or cotransporters. These proteins can be classified as 
either antiporters or symporters. Antiporters couple the downhili flow of one 
species to the uphill flow of another in the opposite direction across the mem¬ 
brane; symporters use the flow of one species to drive the flow of a differ¬ 
ent species in the same direction across the membrane (Figure 13.10). 

The sodium—calcium exchanger in the plasma membrane of an animal cell 
is an antiporter that uses the electrochemical gradient of Na^ to pump Ca^^ 
out of the cell. Three Na^ ions enter the cell for each Ca^^ ion that is ex- 
truded. The cost of transport by this exchanger is paid by the Na^-K^- 
ATPase pump, which generates the requisite sodium gradient. Because 
Ca^^ is a vital messenger inside the cell, its concentration must be tightly 
controlled. The exchanger has lower affinity for Ca^~^ than does the 
Ca^^ATPase (Section 13.2.1), but its capacity to extrude Caf^ is greater. 
The exchanger can lower the cytosolic Ca^^ levei to several micromolar; 
submicromolar Ca^^ leveis are attained by the subsequent action of the 
Ca^^ ATPase. The exchanger can extrude about 2000 Ca^^ ions per sec- 
ond, compared with only 30 ions per second for the Ca^^-ATPase pump. 

Glucose is pumped into some animal cells by a symporter powered by 
the simultaneous entry of Na^. The entry of Na^ provides a free-energy 
input of 2.2 kcal moP^ (9.2 kj moP^) under typicai cellular conditions (ex¬ 
ternai [Na^] = 143 mM, internai [Na^] = 14 mM, and membrane poten- 
tial = — 50 mV). This free-energy input is sufficient to generate a 66-fold 
concentration gradient of an uncharged molecule such as glucose. 

Y Secondary transporters are ancient molecular machines, common 
today in bactéria and archaea as well as in eukaryotes. For example, 
approximately 160 (of approximately 4000) proteins encoded by the E. coli 
genome appear to be secondary transporters. Sequence comparison and hy- 
dropathy analysis suggest that members of the largest family have 12 trans- 
membrane helices that appear to have arisen by duplication and fusion of a 
membrane protein with 6 transmembrane helices. Included in this family 
is the lactose permease of E. coli. This symporter uses the gradient across 

the E. coli membrane generated by the oxidation of fuel molecules to drive 
the uptake of lactose and other sugars against a concentration gradient. The 
permease has a proton-binding site and a lactose-binding site (Figure 13.11). 
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FIGURE 13.12 Energy transduction by 
membrane proteíns. The Na^-K^ ATPase 
converts the free energy of phosphoryl 
transfer into the free energy of a Na^ ion 
gradient. The ion gradient can then be used 
to pump materiais into the cell, through 
the action of a secondary transporter such 
as the Na^-glucose symporter. 


A proton and a lactose molecule bind to sites facing the outside of the cell. 
The permease, with both binding sites full, everts, releasing first the pro¬ 
ton and then the lactose inside the bacterium. Another eversion places the 
empty sites on the outside. Thus, the energetically uphill transfer of one lac¬ 
tose molecule is coupled to the downhill transport of one proton. Further 
analysis of the three-dimensional structures is underway and should pro- 
vide more information about their mechanisms of action as well as the evo- 
lutionary relationships within this large group of ancient proteins. 

These observations reveal how different energy currencies are intercon- 
verted. A single energy currency, ATP, is used by P-type ATPases to gen- 
erate gradients of a small number of types of ions, particularly Na^ and H^, 
across membranes. These gradients then serve as an energy source for the 
large number of secondary transporters, which allow many different mole- 
cules to be taken up or transported out of cells (Figure 13.12). 


13.5 SPECIFIC CHANNELS CAN RAPIDLY TRANSPORT 
IONS ACROSS MEMBRANES 


Pumps and secondary transporters can transport ions at rates approaching 
several thousand ions per second. Other membrane proteins, ion channels, 
which are passive transport systems, are capable of ion-transport rates that 
are more than 1000 times as high. These rates of transport through ion chan¬ 
nels are dose to rates expected for ions diffusing freely through aqueous so- 
lution. Yet, ion channels are not simply tubes that span membranes through 
which ions can rapidly flow. Instead, they are highly sophisticated molec¬ 
ular machines that respond to chemical and physical changes in their envi- 
ronments and undergo precisely timed conformational changes that facili- 
tate their roles as essential components of the nervous and other systems. 

Several key properties characterize ion channels: 

1. Ion channels can be highly selective for particular ions. For example, some 
channels allow the flow of very effectively but do not allow appreciable 
leveis of Na^ to cross the membrane. Other channels transport positively 
charged ions (cations), but block the flow of negatively charged ions (an- 
ions). The selectivities of some ion-channel proteins are shown in Table 13.1. 

2. Ion channels exist in open and closed States. These channels undergo tran- 
sitions from the closed State, incapable of supporting ion transport, to the 
open State, through which ions can flow. 

3. Transitions between the open and the closed States are regulated. Ion chan¬ 
nels are divided into two classes: ligand-gated channels and voltage-gated 
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TABLE 13.1 Relative permeabilities for selected ion channels 



Na"*" channel 

K"'' channel 

Acetylcholine 

receptor 

Chloride 

channel 

Li+ 

0.93 

< 0.01 

0.87 

< 0.01 

Na-" 

1.00 

< 0.01 

1.00 

< 0.01 

K+ 

0.09 

1.00 

1.11 
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FIGURE 13.13 Patch-clamp modes. 

The patch-clamp technique for monitoring 
channel activity is highly versatile. A high- 
resistance seal (gigaseal) is formed 
between the pipette and a small patch of 
plasma membrane. This configuration is 
called cell attached. The breaking of the 
membrane patch by increased suction 
produces a low-resistance pathway between 
the pipette and interior of the cell. The 
activity of the channels in the entire 
plasma membrane can be monitored in 
this whole-cell mode. To prepare a 
membrane in the excised-patch mode, the 
pipette is pulled away from the cell. A 
piece of plasma membrane with its 
cytosolic side now facing the médium is 
monitored by the patch pipette. 


channels. Ligand-gated channels open and dose in response to the binding 
of specific Chemicals, whereas voltage-gated channels open and dose in re¬ 
sponse to the electrical potential across the membrane in which they are 
found. 

4. Open States of channels often spontaneously convert into inactivated States. 
Most ion channels do not remain in an open state indefinitely but, instead, 
spontaneously transform into inactivated States that do not conduct ions. 
The spontaneous transitions of ion channels from open to inactivated States 
act as built-in timers that determine the duration of ion flow. 

13.5.1 Patch-Clamp Conductance Measurements Reveal the 
Activities of Single Channels 

The study of ion channels has been revolutionized by the patch-clamp tech¬ 
nique, which was introduced by Erwin Neher and Bert Sakmann in 1976 
(Figure 13.13). This powerful technique enables the measurement of the ac¬ 
tivity of a single channel to be measured. A clean glass pipette with a tip di- 
ameter of about 1 juim is pressed against an intact cell to form a seal. Slight 
suction leads to the formation of a very tight seal so that the resistance be¬ 
tween the inside of the pipette and the bathing solution is many gigaohms 
(1 gigaohm is equal to 10^ ohms). Thus, a gigaohm seal (called a gigaseal) 












ensures that an electric current flowing through the pipette is identical with 
the current flowing through the membrane covered by the pipette. The gi- 
gaseal makes possible high-resolution current measurements while a known 
voltage is applied across the membrane. In fact, patch clamping increased 
the precision of such measurements 100-fold. The flow of ions through a sin¬ 
gle channel and transitions between the open and closed States of a channel can 
be monitored with a time resolution of microseconds. Furthermore, the activ- 
ity of a channel in its native membrane environment, even in an intact cell, 
can be directly observed. Patch-clamp methods provided one of the first 
views of single biomolecules in action. Subsequently, other methods for ob- 
serving single molecules were invented, opening new vistas on biochemistry 
at its most fundamental levei. 


— I355I — 

lon Channels 


13.5.2 lon-Channel Proteins Are Built of Similar Units 

How do ion channels, vital to a wide array of biological functions, operate 
at a molecular levei? We will examine three channels important in the prop- 
agation of nerve impulses: the ligand-gated channel; the acetylcholine re¬ 
ceptor channel, which communicates the nerve impulse between certain 
neurons; and the voltage-gated Na^ and channels, which conduct the 
nerve impulse down the axon of a neuron. 

Nerve impulses are communicated across most synapses by small, dif- 
fusible molecules called neurotransmitters , of which one is acetylcholine, re- 
ferred to as a cholinergic neurotransmitter because it is derived from 
choline (Section 12.2.1). The presynaptic membrane of a synapse is sepa- 
rated from the postsynaptic membrane by a gap of about 50 nm, called the 
synaptic cleft. The end of the presynaptic axon is filled with synaptic vesi- 
cleSy each containing about lO"^ acetylcholine molecules (Figure 13.14). 
The arrival of a nerve impulse leads to the synchronous export of the con- 
tents of some 300 vesicles, which raises the acetylcholine concentration in 
the cleft from 10 nM to 500 jiiM in less than a millisecond. The binding of 
acetylcholine to the postsynaptic membrane markedly changes its ionic 
permeabilities (Figure 13.15). The conductance of both Na'^ and in- 
creases greatly within 0.1 ms, leading to a large inward current of Na^ and 
a smaller outward current of . The inward Na^ current depolarizes the 
postsynaptic membrane and triggers an action potential (Section 13.5.3). 
Acetylcholine opens a single kind of cation channel, which is almost equally 
permeable to Na'^ and . This change in ion permeability is mediated by 
the acetylcholine receptor. 

The acetylcholine receptor is the best-understood ligand-gated channel. 
The activity of a single such channel is graphically displayed in patch-clamp 



FIGURE 13.14 Schematic representatíon 
of a synapse. 
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FIGURE 13.15 Membrane depolarizatíon. Acetylcholine depolarizes the postsynaptic 
membrane by increasing the conductance of Na^ and K^. 
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FIGURE 13.16 Patch-clamp recordings of the acetylcholíne receptor channel. 

Patch-damp recordings illustrate changes in the conductance of an acetylcholíne receptor 
channel in the presence of acetylcholíne. The channel undergoes frequent transitions 
between open and closed States. [Courtesy of Dr. D. Colquhoun and Dr. B. Sakmann.] 



The torpedo (Torpedo marmorata, aiso 
known as the electric ray) has an electric 
organ, rich in acetylcholíne receptors, that 
can deliver a shock of as much as 200 V 
for approximately 1 s. [Yves Gladu/Jacana/ 
Photo Researchers.] 
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FIGURE 13.17 Schematíc representation 
of the closed form of the acetylcholíne 
receptor channel. In the closed State, 
the narrowest part of the pore is occiuded 
by side chains coming from five helices. 
[Courtesy of Dr. Nigel Unwin.] 


recordings of postsynaptic membranes of skeletal muscle (Figure 13.16). 
The addition of acetylcholine is followed by transient openings of the chan¬ 
nel. The current, i, flowing through an open channel is 4 pA (picoamperes) 
when the membrane potential, V, is —100 mV. An ampere is the flow of 
6.24 X 10^^ charges per second. Hence, 2.5 X 10^ ions per second flow 
through an open channel. 

The electric organ of Torpedo marmorata, an electric fish, is a choice 
source of acetylcholine receptors for study because its electroplaxes (volt- 
age-generating cells) are very rich in cholinergic postsynaptic membranes. 
The receptor is very densely packed in these membranes (— 20 , 000 /fim^). 
An exotic biological material has been invaluable in the isolation of acetyl¬ 
choline receptors. Snake neurotoxins such as a-bungarotoxin (from the 
venom of a Formosan snake) and cobratoxin block the transmission of im¬ 
pulses between nerve and muscle. These small (7-kd) basic proteins bind 
specifically and very tightly to acetylcholine receptors and hence can be used 
as tags. 

Y The acetylcholine receptor of the electric organ has been solubilized 
by adding a nonionic detergent to a postsynaptic membrane prepa- 
ration and purified by affinity chromatography on a column bearing cova- 
lently attached cobratoxin. With the use of techniques presented in Chap- 
ter 4, the 268-kd receptor was identified as a pentamer of four kinds of 
membrane-spanning subunits— a 2 , |3, 7 , and 8 —arranged in the form of a 
ring that creates a pore through the membrane (Figure 13.17). The cloning 
and sequencing of the cDNAs for the four kinds of subunits (50-58 kd) 
showed that they have clearly similar sequences; the genes for the a, (3, 7 , 
and 8 subunits arose by duplication and divergence of a common ancestral 
gene. Each subunit has a large extracellular domain, followed at the car- 
boxyl end by four predominantly hydrophobic segments that span the bi- 
layer membrane. Acetylcholine binds at the a -7 and a -8 interfaces. Elec¬ 
tron microscopic studies of purified acetylcholine receptors demonstrated 
that the structure has approximately fivefold symmetry, in harmony with 
the similarity of its five constituent subunits. 

What is the basis of channel opening? A comparison of the structures of 
the closed and open forms of the channel would be highly revealing, but such 
comparisons have been difficult to obtain. Cryoelectron micrographs indicate 
that the binding of acetylcholine to the extracellular domain causes a struc- 
tural alteration, which initiates rotations of the a-helical rods lining the 
membrane-spanning pore. The amino acid sequences of these helices point 
to the presence of alternating ridges of small polar or neutral residues (serine, 
threonine, glycine) and large nonpolar ones (isoleucine, leucine, phenylala- 
nine). In the closed State, the large residues may occlude the channel by form- 
ing a tight hydrophobic ring (Figure 13.18). Indeed, each subunit has a bulky 
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FIGURE 13.18 Opening of the acetylcholine channel pore. Large hydrophobic side 
chains (L) occiude the pore of the closed form of the acetylcholine receptor channel. 
Channel opening is probably mediated by the tilting of helices that line the pore. Large 
residues move away from the pore and small ones (S) take their place. [After N. Unwin. 
Neuron 3(1989):665.] 


leucine residue at a criticai position. The binding of acetylcholine could allos- 
terically rotate the membrane-spanning helices so that the pore would be lined 
by small polar residues rather than by large hydrophobic ones. The wider, 
more polar pore would then be open to the passage of Na^ and ions. 

13.5.3 Action Potentials Are Mediated by Transient 
Changes in Na^ and Permeability 

We turn now from ligand-gated channels to voltage-gated channels, which 
are responsible for the propagation of nerve impulses. A nerve impulse is an 
electrical signal produced by the flow of ions across the plasma membrane 
of a neuron and is the fundamental means of communication in the nerv- 
ous System. The interior of a neuron, like that of most other cells, has a high 
concentration of and a low concentration of Na^. These ionic gradients 
are generated by an ATP-driven pump (Section 13.2.1). In the resting 
State, the membrane potential is —60 mV. A nerve impulse, or action po- 
tential, is generated when the membrane potential is depolarized beyond 
a criticai threshold value (i.e., from —60 to —40 mV). The membrane po¬ 
tential becomes positive within about a millisecond and attains a value of 
about +30 mV before turning negative again. This amplified depolariza- 
tion is propagated along the nerve terminal (Figure 13.19) 

Ingenious experiments carried out by Alan Hodgkin and Andrew Hux- 
ley revealed that action potentials arise from large, transient changes in the 
permeability of the axon membrane to Na'^ and ions (see Figure 13.19A). 
Two kinds of voltage-sensitive channels, one selectively permeable to Na^ 
and the other to K^, were defined. The conductance of the membrane to 
Na^ changes first. Depolarization of the membrane beyond the threshold 
levei leads to an opening of Na^ channels. Sodium ions begin to flow into 
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FIGURE 13.19 Membrane potential. 

Depolarization of an axon membrane 
resuits in an action potential. Time 
course of (A) the change in membrane 
potential and (B) the change in Na' 
and conductances. 
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the cell because of the large electrochemical gradient across the plasma 
membrane. The entry of Na^ further depolarizes the membrane, and so 
more gates for Na^ are opened. This positive feedback between depolar- 
ization and Na^ entry leads to a very rapid and large change in membrane 
potential, from about —60 mV to +30 mV in a millisecond. 

Sodium channels spontaneously dose and potassium channels begin to 
open at about this time (see Figure 13.19B). Consequently, potassium ions 
flow outward, and so the membrane potential returns to a negative value. 
The resting levei of — 60 mV is restored in a few milliseconds as the con- 
ductance decreases to the value characteristic of the unstimulated State. 
Only a very small proportion of the sodium and potassium ions in a nerve 
cell, of the order of one in a million, flows across the plasma membrane dur- 
ing the action potential. Clearly, the action potential is a very efficient means 
of signaling over large distances. 



Tetrodotoxin 



A puffer fish is regarded as a culinary 
delicacy in Japan. [Fred Bavendam/ 
Peter Arnold.] 


13.5.4 The Sodium Channel Is an Example of a 
Voltage-Gated Channel 

Like the acetylcholine receptor channel, the sodium channel also was puri- 
fied on the basis of its ability to bind a specific neurotoxin. Tetrodotoxin, 
an organic compound isolated from the puffer fish, binds to sodium chan¬ 
nels with great avidity {K[ ~ 1 nM). The lethal dose of this poison for an 
adult human being is about 10 ng. The sodium channel was first purified 
from the electric organ of electric eel, which is a rich source of the protein 
forming this channel. The isolated protein is a single chain of 260 kd. 

Y The availability of purified protein enabled Shosaku Numa and 
coworkers to clone and sequence the cDNA for the sodium channel 
from the electroplax cells of the eel electric organ and then from the rat. Sub- 
sequently, a large number of sodium channel cDNAs have been cloned from 
other sources, and sequence comparisons have been made. The eel and rat 
cDNA sequences are approximately 61% identical, which indicates that the 
amino acid sequence of the sodium channel has been conserved over a long 
evolutionary period. Most interesting, the channel contains four internai re- 
peats, or homology units, having similar amino acid sequences, suggesting 
that gene duplication and divergence have produced the gene for this chan¬ 
nel. Hydrophobicity profiles indicate that each homology unit contains five 
hydrophobic segments (Sl, S2, S3, S5, and S6). Each repeat also contains a 
highly positively charged S4 segment; arginine or lysine residues are present 
at nearly every third residue. Numa proposed that segments Sl through S6 are 
membrane-spanning a helices (Figure 13.20). The positively charged residues 
in S4 segments act as the voltage sensors of the channel. The purification of 
calcium channels and the subsequent cloning and sequencing of their cDNAs 
revealed that these proteins are homologous to the sodium channels and have 
quite similar architectures; each protein comprises four imperfectly repeated 
units, each of which has regions corresponding to segments Sl through S6. 


FIGURE 13.20 The sodium channel. 

The Na^ channel is a single polypeptide 
chain with four repeating units (l-IV). 

Each repeat probably folds into six 
transmembrane helices. The loops (shown 
in red) between helices 5 and 6 of each 
domain form the pore of the channel. 














































We can thus note similarities between ligand-gated and voltage-gated 
channels. Like the acetylcholine receptor, the sodium channel is constructed 
of similar units. The acetylcholine receptor has five units, whereas the 
sodium channel has four units that have been fused into a single polypep- 
tide chain. The acetylcholine receptor is composed of similar but noncova- 
lently attached subunits. 


13.5.5 Potassium Channels Are Homologous to the 
Sodium Channel 

The purification of potassium channels proved to be much more 
* difficult because of their low abundance and the lack of known high- 
affinity ligands comparable to tetrodotoxin. The breakthrough carne in 
studies of mutant fruit flies that shake violently when anesthetized with 
ether. The mapping and cloning of the gene, termed shaker, responsible for 
this defect revealed the amino acid sequence encoded by a potassium- 
channel gene. The availability of this gene sequence has led to the cloning 
of potassium-channel cDNAs from many other organisms. Shaker cDNA 
encodes a 70-kd protein that has regions that correspond to one of the ho- 
mology units of the sodium channel containing the membrane-spanning 
segments Sl through S6. Thus, a potassium-channel subunit is homologous 
to one of the repeated homology units of the sodium and calcium channels. 
Consistent with this hypothesis, four potassium-channel subunits come to- 
gether to form a functional channel. Subsequently, other potassium chan¬ 
nels were discovered, including some from bactéria, which contain only the 
two membrane-spanning regions corresponding to segments S5 and S6. 
This and other information poínted to the region between S5 and Sô as a key 
component of the ion-channel pore in the potassium channel and in the sodium 
and calcium channels as welL The sequence relationships between these ion 
channels are summarized in Figure 13.21. 
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13.5.6 The Structure of a Potassium Channel Reveals the Basis 
of Rapid Ion Flow with Specificity 


STRUCTURAL INSICHTS, The Potassium Channel, examines the structural basis 
of the potassium channers ion specificity and high conductivity in further detail. 

Scientists were slowly discovering the likely structures of ion channels 
through a combination of patch-clamp methods, site-directed mutagenesis, 
and other methods. However, progress was limited by the lack of a high- 
resolution three-dimensional structure. The need was met by the determina- 
tion of the structure of a bacterial potassium channel by x-ray crystallography 
in 1998. The resulting structural framework is a source of insight into many 
aspects of ion-channel function, including specificity and rapidity of ion flow. 

As expected, the potassium channel is a tetramer of identical subunits, 
each of which includes two membrane-spanning a helices. The four subunits 
come together to form a pore in the shape of a cone that runs through the cen- 
ter of the structure (Figure 13.22). Beginning from the inside of the cell, the 
pore starts with a diameter of approximately 10 À and then constricts 
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FIGURE 13.21 Sequence relationships 
of ion channels. Like colors indicate 
structurally similar regions of the sodium, 
calcium, and potassium channels. These 
channels exhibit approximate fourfold 
symmetry. 


























View down the pore 



FIGURE 13.22 Structure of the potassium channel. The potassium channel, 
composed of four identical subunits, is cone shaped, with the larger opening facing the 
inside of the cell (center). A view down the pore, looking toward the outside of the cell, 
shows the relations of the individual subunits (left). One of the four identical subunits of 
the pore is illustrated at the right, with the pore-forming region highlighted in red. 


FIGURE 13.23 Path through a channel. 

A potassium ion entering the potassium 
channel can pass a distance of 22 Â into 
the membrane while remaining solvated 
with water (blue). At this point, the pore 
diameter narrows to 3 Â (yellow), and 
potassium must shed its water and 
interact with carbonyl groups (red) of the 
pore amino acids. 



34 Â 




FIGURE 13.24 Selectívíty filter of the potassium channel. Potassium ions interact with 
the carbonyl groups of the TVGYG sequence of the selectivity filter, located at the 3-Â-diameter 
pore of the potassium channel. 









to a smaller cavity with a diameter of 8 À. Both the opening to the outside 
and the central cavity of the pore are filled with water, and a ion can fit 
in the pore without losing its shell of bound water molecules. Approximately 
two-thirds of the way through the membrane, the pore becomes more con- 
stricted (3-A diameter). At that point, any ions must give up their wa¬ 
ter molecules and interact directly with groups from the protein. The chan- 
nel structure effectively reduces the thickness of the membrane from 34 À 
to 12 A by allowing the solvated ions to penetrate into the membrane be- 
fore the ions must directly interact with the channel (Figure 13.23). 

For potassium ions to relinquish their water molecules, other polar inter- 
actions must replace those with water. The restricted part of the pore is built 
from residues between the two transmembrane helices (which correspond to 
segments S5 and S6 in the sodium channel). In particular, a five-amino-acid 
stretch within this region functions as the selectivity filter that determines the 
preference for over other ions (Figure 13.24). The stretch has the sequence 
Thr-Val-Gly-Tyr-Gly, which is nearly completely conserved in all chan- 
nels and had already been identified as a signature sequence useful for iden- 
tifying potential channels. This region lies in a relatively extended con- 
formation and is oriented such that the peptide carbonyl groups are directed 
into the channel, facilitating interaction with the potassium ions. 

Potassium channels are 100-fold as permeable to as to Na^. How is 
this high degree of selectivity achieved? The narrow diameter (3 À) of the 
selectivity filter of the potassium channel enables the filter to reject ions 
having a radius larger than 1.5 A. However, a bare Na^ is small enough 
(Table 13.2) to pass through the pore. Indeed, the ionic radius of Na^ is 
substantially smaller than that of K^. How then is Na^ rejected? 

We need to consider the free-energy cost of dehydrating the Na^ and 
ions, given that they cannot pass through this part of the channel bearing a 
retinue of water molecules. The key point is that the free-energy costs of de¬ 
hydrating these ions are considerable [Na^, 72 kcal moP^ (301 kj moP^), 
and K^, 55 kcal moP^ (203 kJ moP^)]. The channel pays the cost of dehy¬ 
drating by providing compensating interactions with the carbonyl oxygen 
atoms lining the selectivity filter. However, these oxygen atoms are positioned 
such that they do not interact very favorably with Na^, because it is too small 
(Figure 13.25). The higher cost of dehydrating Na^ would be unrecovered, 
and so Na^ would be rejected. The ionic radii of oxygen, potassium, and 
sodium are 1.4, 1.33, and 0.95 À, respectively. Hence a ring of oxygen atoms 
positioned so that the K^-O distance is 2.73 Â (1.4 + 1.33 Â) would be op- 
timal for interaction with compared with the shorter Na^-O bonds (0.95 
+ 1.4 = 2.35 A) optimal for interaction with Na^.Thus, the potassium chan¬ 
nel avoids closely embracing Na^ ions, which must stay hydrated and hence 
are impermeant. 
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TABLE 13.2 Properties of alkaii 
cations 


Ion 

Ionic 

radius 

o 

(A) 

Hydration 
free energy 
in kcal mol~^ 
(kJ mol-') 

Li+ 

0.60 

-98 (-410) 

Na-" 

0.95 

-72 (-301) 

K+ 

1.33 

-55 (-230) 

Rb-" 

1.48 

-51 (-213) 

Cs-" 

1.69 

-47 (-197) 


FIGURE 13.25 Energetic basís of ion 
selectivity. The energy cost of dehydrating 
a potassium ion is compensated by 
favorable interactions with the selectivity 
filter. Because sodium is too small to 
interact favorably with the selectivity filter, 
the free energy of desolvation cannot be 
compensated and the sodium does not 
pass through the channel. 
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In addition to selectivity, ion channels display rapid rates of ion transport. 
A structural analysis provides an appealing explanation for this proficiency. 
The results of such studies revealed the presence of two potassium-binding 
sites in the constricted regions of the potassium channel that are crucial for 
rapid ion flow. Consider the process of ion conductance. One ion pro- 
ceeds into the channel and through the relatively unrestricted part of the 
channel. It then gives up most or all of its coordinated water molecules and 
binds to the first site in the selectivity filter region, a favorable binding site. 
It can then jump to the second site, which appears to have comparable bind¬ 
ing energy. However, the binding energy of the second site presents a free- 
energy barrier, or trap, preventing the ion from completing its journey; there 
is no energetic reason to leave the second ion-binding site. However, if a 
second ion moves through the channel into the first site, the electrostatic 
repulsion between the two ions will destabilize the initially bound ion and 
help push it into solution (Figure 13.26). This mechanism provides a solu- 
tion to the apparent paradox of high ion selectivity (requiring tight binding 
sites) and rapid flow. 



FIGURE 13.26 Two-site model for the 
potassium channel. The restricted part 
of the potassium channel has two 
energetically similar binding sites. The 
binding of a second potassium ion creates 
electrostatic repulsion to push the first ion 
out of the channel. 



Y The structure determined for channels is a good start for 
considering the amino acid sequence similarities, as well as the struc¬ 
tural and functional relations, for Na^ and Ca^^ channels because of their 
homology to channels. Sequence comparisons and the results of muta- 
genesis experiments have also implicated the region between segments S5 
and S6 in ion selectivity in the Ca^^ channels. In Ca^^ channels, one glu- 
tamate residue of this region in each of the four units plays a major role in 
determining ion selectivity. The Na^ channels selection of Na^ o ver 
depends on ionic radius; the diameter of the pore is sufficiently restricted 
that small ions such as Na^ and Li^ can pass through the channel, but larger 
ions such as are significantly hindered (Figure 13.27). 


FIGURE 13.27 Selectivity of the sodium 
channel. The ionic selectivity of the 
sodium channel partiy depends on steric 
factors. Sodium and lithium ions, together 
with a water molecule, fit in the channel, 
as do hydroxylamine and hydrazine. In 
contrast, with a water molecule is too 
large. [After R. D. Keynes. Ion channels in the 
nerve-cell membrane. Copyright © 1979 by 
Scientific American, Inc. All rights reserved.] 
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Residues in the positions corresponding to the glutamate residues in 
Ca^^ channels are major components of the selectivity filter of the Na^ 
channel. These residues are aspartate, glutamate, lysine, and alanine in units 
1 , 2, 3, and 4, respectively (the DEKA locus). Thus, the potential fourfold 
symmetry of the channel is clearly broken in this region, providing one ex- 
planation of why Na^ channels comprise single large polypeptide chains 
rather than a noncovalent assembly of four identical subunits. 


13.5.8 A Channel Can Be Inactivated by Occiusion of the Pore: 
The Ball-and-Chain Model 

The potassium channel and the sodium channel undergo inactivation within 
milliseconds of channel opening (Figure 13.28). A first clue to the mecha- 
nism of inactivation carne from exposing the cytoplasmic side of either chan¬ 
nel to trypsin; cleavage by trypsin produced a trimmed channel that stayed 
persistently open after depolarization. A second clue was the finding that 
alternatively spliced variants of the potassium channel have markedly dif- 
ferent inactivation kinetics; these variants differed from one another only 
near the amino terminus, which is on the cytoplasmic side of the channel. 
A mutant Shaker channel lacking 42 amino acids near the amino terminus 
opened in response to depolarization but did not inactivate (see Figure 
13.28). Most revealing, inactivation was restored by adding a synthetic pep- 
tide corresponding to the first 20 residues of the native channel. 

These experiments strongly support the bali- and- chain model for chan¬ 
nel inactivation that had been proposed years earlier (Figure 13.29). Accord- 
ing to this model, the first 20 residues of the potassium channel form a cyto¬ 
plasmic unit (the bali) that is attached to a flexible segment of the polypeptide 
(the chain). When the channel is closed, the bali rotates freely in the aque- 
ous solution. When the channel opens, the bali quickly finds a complemen- 
tary site in the pore and occludes it. Hence, the channel opens for only a brief 
interval before it undergoes inactivation by occiusion. Shortening the chain 
speeds inactivation because the bali finds its target more quickly. Conversely, 
lengthening the chain slows inactivation. Thus, the duration of the open state 
can be controlled by the length and flexibility of the tether. 




FIGURE 13.28 Inactivation of the 
potassium channel. The amino-terminal 
region of the potassium chain is criticai 
for inactivation. (A) The wild-type Shaker 
potassium channel displays rapid 
inactivation after opening. (B) A mutant 
channel lacking residues 6 through 46 
does not inactivate. (C) Inactivation can be 
restored by adding a peptide consisting of 
residues 1 through 20 at a concentration 
of 100 pM. [After W. N. Zagotta, T. Hoshi, and 
R. W. Aldrich. Science 250(1990):568.] 


FIGURE 13.29 Ball-and-chain model 
for channel inactivation. The inactivation 
domain, or "bali" (red), is tethered to the 
channel by a flexible "chain" (green). In 
the closed State, the bali is located in the 
cytosol. Depolarization opens the channel 
and creates a negatively charged binding 
site for the positively charged bali near the 
mouth of the pore. Movement of the bali 
into this site inactivates the channel by 
occiuding it. [After C. M. Arnnstrong and 
F. Bezanilla. 7. Gen. Physiol. 70(1977);567] 


13.6 GAP JUNCTIONS ALLOW lONS AND SMALL 
MOLECULES TO FLOW BETWEEN COIVIIVIUNICATING 
CELLS 

The ion channels that we have considered thus far have narrow pores and 
are moderately to highly selective in regard to which ions are permeant. 
They are closed in the resting state and have short lifetimes in the open state, 
typically a millisecond, that enable them to transmit highly frequent neu- 
ral signals. We turn now to a channel with a very different role. Gap junc- 
tions, also known as cell-to-cell channels, serve as passageways between the 




























FIGURE 13.30 Gap junctions. (A) Th is 

electron micrograph shows a sheet of 
isolated gap junctions. The cylindrical 
connexons form a hexagonal lattice having 
a unit-cell length of 85 Â. The densely 
stained central hole has a diameter of 
about 20 Â. (B) Electron micrograph of a 
tangential view of apposed cell membranes 
that are joined by gap junctions. 

[(A) Courtesy of Dr. Nigel Unwin and 

Dr. Cuido Zampighi; (B) from E. L. Hertzberg 

and N. B. Gilula. 7. Biol. Chem. 254(1979):2143.] 


Plasma Extracellular 



FIGURE 13.31 Schematic representation 
of a gap junctíon. [Courtesy of Dr. Werner 
Loewenstein.] 


interiors of contiguous cells. Gap junctions are clustered in discrete regions 
of the plasma membranes of apposed cells. Electron micrographs of sheets 
of gap junctions show them tightly packed in a regular hexagonal array 
(Figure 13.30A). A 20-A central hole, the lumen of the channel, is promi- 
nent in each gap junction. A tangential view (Figure 13.30B) shows that 
these channels span the intervening space, or gap, between apposed cells 
(hence, the name gap junction). The width of the gap between the cytosols 
of the two cells is about 35 À. 

Small hydrophilic molecules as well as ions can pass through gap junc¬ 
tions. The pore size of the junctions was determined by microinjecting a 
series of fluorescent molecules into cells and observing their passage into 
adjoining cells. All polar molecules with a mass of less than about 1 kd can 
readily pass through these cell-to-cell channels. Thus, inorganic ions and 
most metabolites (e.g., sugars, amino acids, and nucleotides) canflow between 
the interiors of cells joined by gap junctions. In contrast, proteins, nucleic 
acids, and polysaccharides are too large to traverse these channels. Gap junc¬ 
tions are important for intercellular communication. Cells in some excitable 
tissues, such as heart muscle, are coupled by the rapid flow of ions through 
these junctions, which ensure a rapid and synchronous response to stimuli. 
Gap junctions are also essential for the nourishment of cells that are distant 
from blood vessels, as in lens and bone. Moreover, communicating chan¬ 
nels are important in development and differentiation. For example, a 
pregnant uterus is transformed from a quiescent protector of the fetus to a 
forceful ejector at the onset of labor; the formation of functional gap junc¬ 
tions at that time creates a syncytium of muscle cells that contract in 
synchrony. 

A cell-to-cell channel is made of 12 molecules of connexin, one of a fam- 
ily of transmembrane proteins with molecular masses ranging from 30 to 
42 kd. Each connexin molecule appears to have four membrane-spanning 
helices. Six connexin molecules are hexagonally arrayed to form a half chan¬ 
nel, called a connexon or hemichannel (Figure 13.31). Two connexons join end 
to end in the intercellular space to form a functional channel between the 
communicating cells. Cell-to-cell channels differ from other membrane 
channels in three respects: (1) they traverse two membranes rather than one; 
(2) they connect cytosol to cytosol, rather than to the extracellular space or 
the lumen of an organelle; and (3) the connexons forming a channel are syn- 
thesized by different cells. Gap junctions form readily when cells are brought 
together. A cell-to-cell channel, once formed, tends to stay open for seconds 
to minutes. They are closed by high concentrations of calcium ion and by 
low pH. The closing of gap junctions by Caf^ and serves to seal normal 
cells from traumatized or dying neighbors. Gap junctions are also controlled 
by membrane potential and by hormone-induced phosphorylation. 



















SUMMARY 


Summary 

The Transport of Molecules Across a Membrane May Be Active or Passive 

For a net movement of molecules across a membrane, two features are 
required: (1) the molecule must be able to cross a hydrophobic barrier 
and (2) an energy source must power the movement. Lipophilic mole¬ 
cules can pass through a membrane s hydrophobic interior by simple dif- 
fusion. These molecules will move down their concentration gradients. 

Polar or charged molecules require proteins to form passages through 
the hydrophobic barrier. Passive transport or facilitated diffusion occurs 
when an ion or polar molecule moves down its concentration gradient. 

If a molecule moves against a concentration gradient, an externai energy 
source is required; this movement is referred to as active transport and 
results in the generation of concentration gradients. Concentration gra¬ 
dients are a commonly used form of energy in all organisms. 

A Family of Membrane Proteins Uses ATP Hydrolysis to Pump lons 
Across Membranes 

Active transport is often carried out at the expense of ATP hydrolysis. 

P-type ATPases pump ions against a concentration gradient and be- 
come transiently phosphorylated on an aspartic acid residue in the 
process of transport. P-type ATPases, which include the sarcoplasmic 
reticulum Ca^^ ATPase and the Na^-K^ ATPase, are integral mem¬ 
brane proteins with conserved structures and catalytic mechanisms. 

Multidrug Resistance and Cystic Fibrosis Highlight a Family of Membrane 
Proteins with ATP-Binding Cassette Domains 

The membrane proteins with ATP-binding cassette (ABC) domains are 
complex ATP-dependent pumps. Each pump includes four major domains: 
two domains span the membrane and two others contain ABC P-loop 
ATPase structures. The multidrug resistance proteins confer resistance on 
câncer cells by pumping chemotherapeutic drugs out of a câncer cell be- 
fore the drugs can exert their effects. Another ABC domain protein is the 
cystic fibrosis transmembrane conductance regulator (CFTR), an ATP- 
gated chloride channel. Mutations in CFTR can result in cystic fibrosis. 

Secondary Transporters Use One Concentration Gradient to Power the 
Formation of Another 

Many active-transport systems couple the uphill flow of one ion or mole¬ 
cule to the downhill flow of another. These membrane proteins, called 
secondary transporters or cotransporters, can be classified as antiporters 
or symporters. Antiporters couple the downhill flow of one type of ion 
in one direction to the uphill flow of another in the opposite direction. 

Symporters move both ions in the same direction. 

o Specific Channels Can Rapidiy Transport lons Across Membranes 

Ion channels allow the rapid movement of ions across the hydrophobic 
barrier of the membrane. Such channels allow ions to flow down their 
concentration gradients. The channels have several features in common: 

(1) ion specificity, (2) the existence of open and closed States, (3) regu- 
lation by ligands or voltage. Ion channels are exemplified by the Na^ 
and channels responsible for nerve impulses. 

Gap Junctions Allow lons and Small Molecules to Flow 
Between Communicating Cells 

In contrast with many channels, which connect the cell interior with the 
environment, gap junctions, or cell-to-cell channels, serve to connect 
the interiors of contiguous groups of cells. A cell-to-cell channel is com- 
posed of 12 molecules of connexin, which associate to form two 6-mem- 
bered connexons. 
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-1 KEY TERMS |- 


facilitated diffusion (passive transport) 
(p. 346) 

active transport (p. 346) 

Na+-K+ pump (Na+-K+-ATPase) 

(p. 347) 

P-type ATPase (p. 348) 
digitalis (p. 350) 
multidrug resistance (p. 351) 


ATP-binding cassette (ABC) domain 
(p.351) 

secondary transporter (cotransporter) 
(p. 352) 

antiporter (p. 352) 
symporter (p. 352) 
ion channel (p. 353) 
ligand-gated channel (p. 353) 


voltage-gated channel (p. 353) 
patch-clamp technique (p. 354) 
gigaseal (p. 354) 
neurotransmitter (p. 355) 
action potential (p. 357) 
selectivity filter (p. 361) 
gap junction (p. 363) 
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-1 PROBLEMS I- 

1. Concerted opening. Suppose that a channel obeys the con¬ 
certed allosteric model (MWC model, Section 10.3.4). The 
binding of ligand to the R state (the open form) is 20 times as 
tight as to the T state (the closed form). In the absence of lig¬ 
and, the ratio of closed to open channels is 10^. If the channel is 
a tetramer, what is the fraction of open channels when 1, 2, 3, 
and 4 ligands are bound? 

2. Respiratory paralysis. Tabun and sarin have been used as 
Chemical-warfare agents, and parathion has been employed as 
an insecticide. What is the molecular basis of their lethal actions? 



3. Ligand-induced channel opening. The ratio of open to closed 
forms of the acetylcholine receptor channel containing zero, one, 
and two bound acetylcholine molecules is 5 X 10~^, 1.2 X 10~^, 
and 14, respectively. 

(a) By what factor is the open/closed ratio increased by the 
binding of the first acetylcholine molecule? The second acetyl¬ 
choline molecule? 

(b) What are the corresponding free-energy contributions to 
channel opening at 25°C? 

(c) Can the allosteric transition be accounted for by the MWC 
concerted model? 

4. Voltage-induced channel opening. The fraction of open chan¬ 
nels at 5 mV increments beginning at — 45 mV and ending at 
+ 5 mV at 20°C is 0.02, 0.04, 0.09, 0.19, 0.37, 0.59, 0.78, 0.89, 
0.95, 0.98, and 0.99. 

(a) At what voltage are half the channels open? 

(b) What is the value of the gating charge? 

(c) How much free energy is contributed by the movement of 
the gating charge in the transition from —45 mV to +5 mV? 

5. Different directions. The potassium channel and the sodium 
channel have similar structures and are arranged in the same ori- 
entation in the cell membrane. Yet, the sodium channel alio ws 
sodium ions to flow into the cell and the potassium channel al- 
lows potassium ions to flow out of the cell. Explain. 


Parathion 
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6 . Structure-activity relations. On the basis of the structure of 
tetrodotoxin, propose a mechanism by which the toxin inhibits 
sodium flow through the sodium channel. 

7. A dangerous snail. Cone snails are carnivores that inject a 
powerful set of toxins into their prey, leading to rapid paralysis. 
Many of these toxins are found to bind to specific ion-channel 
proteins. Why are such molecules so toxic? How might such tox¬ 
ins be useful for biochemical studies? 

8 . Only afeu). Why do only a small number of sodium ions need 
to flow through the sodium channel to significantly change the 
membrane potential? 

9. Frog poison. Batrachotoxin (BTX) is a steroidal alkaloid from 
the skin of Phyllobates terribilis, a poisonous Colombian frog 
(source of the poison used on blowgun darts). In the presence of 
BTX, sodium channels in an excised patch stay persistently open 
when the membrane is depolarized. They close when the mem¬ 
brane is repolarized. Which transition is blocked by BTX? 



Phyllobates terribilis. [Tom McHugh, Photo Researchers.] 


Chapter Integration Problem 

14. Speed and efficiency matter. Acetylcholine is rapidly de- 
stroyed by the enzyme acetylcholinesterase. This enzyme, which 
has a turno ver number of 25,000 per second, has attained cat- 
alytic perfection with a fecat/^M of2 X 10^M~^s~\ Why is it 
physiologically crucial that this enzyme be so efficient? 

Mechanism Problem 

15. Remembrance of mechanisms past. Acetylcholinesterase con- 
verts acetylcholine into acetate and choline. Show the reaction as 
Chemical structures : 

Acetylcholine + H 2 O-> acetate + choline 

Like serine proteases, acetylcholinesterase is inhibited by DIPF. 
Propose a catalytic mechanism for acetylcholine digestion by 
acetylcholinesterase. 

Data Interpretation Problem 

16. Tarantula toxin. Acid sensing is associated with pain, tast- 
ing, and other biological activities (Chapter 32). Acid sensing is 
carried out by a ligand-gated channel that permits sodium in- 
flux in response to H^. This family of acid-sensitive ion chan¬ 
nels (ASICs) comprises a number of members. Psalmotoxin 1 
(PcTXl), a venom from the tarantula, inhibits some members 
of this family. Below are electrophysiological recordings of cells 
containing one of several members of the ASIC family made in 
the presence of the toxin at a concentration of 10 nM. The chan¬ 
nels were opened by changing the pH from 7.4 to the indicated 
values. The PcTXl was present for a short time (indicated by 
the black bar above the recordings) after which time it was rap¬ 
idly washed from the system. 


10 . Valium target. 7 -Aminobutyric acid (GABA) opens chan¬ 
nels that are specific for chloride ions. The GABAa receptor 
channel is pharmacologically important because it is the target 
of Valium, which is used to diminish anxiety. 

(a) The extracellular concentration of CP is 123 mM and the 
intracellular concentration is 4 mM. In which direction does CP 
flow through an open channel when the membrane potential is 
in the —60 mV to +30 mV range? 

(b) What is the effect of chloride-channel opening on the ex- 
citability of a neuron? 

(c) The hydropathy profile of the GABAa receptor resembles 
that of the acetylcholine receptor. Predict the number of sub- 
units in this chloride channel. 

11 . The price of extrusion. What is the free-energy cost of pump- 
ing Ca^^ out of a cell when the cytosolic concentration is 
0.4 fxM, the extracellular concentration is 1.5 mM, and the 
membrane potential is —60 mV? 

12. Rapid transit. A channel exhibits current increments of 5 pA 
at a membrane potential of — 50 mV. The mean open time is 
1 ms. 

(a) What is the conductance of this channel? 

(b) How many univalent ions flow through the channel during 
its mean open time? 

(c) How long does it take an ion to pass through the channel? 

13. Pumping protons. Design an experiment to show that lactose 
permease can be reversed in vitro to pump protons. 



ASIC3 


pH4 - 



100 s 



(A) Electrophysiological recordings of cells exposed to tarantula 
toxin. (B) Plot of peak current of a cell containing the ASICla 
protein versus the toxin concentration. [From P. Escoubas, et al., 2000, 
J. Biol. Chem. 275:25116-215121.] 


(a) Which of the ASIC family members^ASICla, ASIClb, 
ASIC 2 a, or ASIC3—is most sensitive to the toxin? 

(b) Is the effect of the toxin reversible? Explain. 

(c) What concentration of PcTXl yields 50% inhibition of the 
sensitive channel? 








































17. Channel problems 1. A number of pathological conditions re- 
sult from mutations in the acetylcholine receptor channel. One 
such mutation in the (3 subunit, |3V266M, causes muscle weak- 
ness and rapid fatigue. An investigation of the acetylcholine- 
generated currents through the acetylcholine receptor channel 
for both a control and a patient yielded the foliowing results. 
What is the effect of the mutation on channel function? Suggest 
some possible biochemical explanations for the effect. 


Control 


nn/ir^nnjiT" 


Closed channel 
Open channel 


Patient 

Closed channel 
Open channel 



18. Channel problems 2. The acetylcholine receptor channel can 
also undergo mutation leading to fast channel syndrome (FCS), 
with clinicai manifestations similar to those of slow channel syn¬ 
drome (SCS). What would the recordings of ion movement look 
like in this syndrome? Again, suggest a biochemical explanation. 

19. Transport differences. The rate of transport of two molecules, 
Índole and glucose, across a cell membrane is shown in the right 
column. What are the differences between the transport mech- 
anisms of the two molecules? Suppose that ouabain inhibited the 
transport of glucose. What would this inhibition suggest about 
the mechanism of transport? 
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Media Problem 


20 . The merits of inflexibility. The selectivity filter of potassium 
channels restricts passage of sodium ions even though sodium 
ions are smaller than potassium ions. Mutation of the tyrosine 
that is part of this filter to leucine has been shown to reduce se¬ 
lectivity against sodium in a channel homologous to the one 
whose structure has been determined [Chapman, M. L., 
Krovetz, H. S., and VanDongen, A. M. J., 2001. GYGD pore 
motifs in neighboring potassium channel subunits interact to de¬ 
termine ion selectivity. /. Physiol 530:21-33]. Look at the three- 
dimensional structure in the Potassium Channel Structural In- 
sights module and propose, in general terms, an explanation for 
the role of this tyrosine in the wild-type protein and the effect 
of its mutation to leucine. 





Metabolism: Basic Concepts and Design 




Hummingbirds are capable of prodigíous feats of 
endurance. For instance, the tiny ruby-throated hummingbird 
can store enough fuel to fly across the Gulf of México, a 
distance of some 500 miles, without resting. This achievement 
is possible because of the ability to convert fuels into the 
cellular energy currency, ATP, represented by the model at 
the right. [(Left) K. D. McGraw/Rainbow.] 



The concepts of conformation and dynamics developed in Part I—espe- 
cially those dealing with the specificity and catalytic power of enzymes, the 
regulation of their catalytic activity, and the transport of molecules and 
ions across membranes—enable us to now ask questions fundamental 
to biochemistry: 

1 . How does a cell extract energy and reducing power from its environment? 

2. How does a cell synthesize the building blocks of its macromolecules and 
then the macromolecules themselves? 

These processes are carried out by a highly integrated net- 
work of Chemical reactions that are collectively known as 
metabolism. 

More than a thousand chemical reactions take place in 
even as simple an organism as Escherichia coli. The array of 
reactions may seem overwhelming at first glance. However, 
closer scrutiny reveals that metabolism has a coherent design 
containing many common motifs. These motifs include the 
use of an energy currency and the repeated appearance of a 
limited number of activated intermediates. In fact, a group 
of about 100 molecules play central roles in all forms of life. 

Furthermore, although the number of reactions in metab¬ 
olism is large, the number of kinds of reactions is small and 
the mechanisms of these reactions are usually quite simple. Metabolic path- 
ways are also regulated in common ways. The purpose of this chapter is to 
introduce some general principies and motifs of metabolism to provide a 
foundation for the more detailed studies to follow. 


OUTLINE 


O 14.1 Metabolism Is Composed of Many 
Coupled, Interconnecting Reactions 

o 14.2 The Oxidation of Carbon Fuels Is 
an Important Source of Cellular Energy 

o 14.3 Metabolic Pathways Contain Many 
Recurring Motifs 
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FIGURE 14.1 Glucose metabolism. 

Glucose is metabolized to pyruvate in 
10 linked reactions. Under anaerobic 
conditions, pyruvate is metabolized to 
lactate and, under aerobic conditions, to 
acetyl CoA. The glucose-derived carbons 
are subsequentiy oxidized to CO 2 . 


14.0.1 Cells Transform Different Types of Energy 

Living organisms require a continuai input of free energy for three major 
purposes: (1) the performance of mechanical work in muscle contraction and 
other cellular movements, (2) the active transport of molecules and ions, and 
(3) the synthesis of macromolecules and other biomolecules from simple pre- 
cursors. The free energy used in these processes, which maintain an organ- 
ism in a state that is far from equilibrium, is derived from the environment. 

The First Law of Thermodynamics States that energy can be neither cre- 
ated nor destroyed. The amount of energy in the universe is constant. Nev- 
ertheless, energy can be converted from one form into another. 

Photosynthetic organisms, or phototrophs, use the energy of sunlight to 
convert simple energy-poor molecules into more-complex energy -rich mol¬ 
ecules that serve as fuels. In other words, photosynthetic organisms trans¬ 
form light energy into chemical energy. Indeed, this transformation is ulti- 
mately the primary source of chemical energy for the vast majority of 
organisms, human beings included. Chemotrophs, which include animais, 
obtain chemical energy through the oxidation of foodstuffs generated by 
phototrophs. 

Chemical energy obtained from the oxidation of carbon compounds may 
be transformed into the unequal distribution of ions across a membrane, re- 
sulting in an ion gradient. This gradient, in turn, is an energy source that 
can be used to move molecules across membranes, that can be converted 
into yet other types of chemical energy, or that can convey Information in 
the form of nerve impulses. In addition, chemical energy can be transduced 
into mechanical energy. We convert the chemical energy of a fuel into struc- 
tural alterations of contractile proteins that result in muscle contraction and 
movement. Finally, chemical energy powers the reactions that result in the 
synthesis of biomolecules. 

At any given instant in a cell, thousands of energy transformations are 
taking place. Energy is being extracted from fuels and used to power biosyn- 
thetic processes. These transformations are referred to as metabolism or 
intermediary metabolism. 


14.1 METABOLISIVI IS COMPOSED OF MANY COUPLED, 
INTERCONNECTING REACTIONS 

Metabolism is essentially a linked series of chemical reactions that begins 
with a particular molecule and converts it into some other molecule or mol¬ 
ecules in a carefully defined fashion (Figure 14.1). There are many such de- 
fined pathways in the cell (Figure 14.2), and we will examine a few of them 
in some detail later. These pathways are interdependent, and their activity 
is coordinated by exquisitely sensitive means of communication in which 
allosteric enzymes are predominant (Section 10.1). We will consider the 
principies of this communication in Chapter 15. 

We can divide metabolic pathways into two broad classes: (1) those that 
convert energy into biologically useful forms and (2) those that require in- 
puts of energy to proceed. Although this division is often imprecise, it is 
nonetheless a useful distinction in an examination of metabolism. Those re¬ 
actions that transform fuels into cellular energy are called catabolic reactions 
or, more generally, catabolism. 

Fuels (carbohydrates, fats) > CO 2 + H 2 O + useful energy 

Those reactions that require energy—such as the synthesis of glucose, fats, 
or DNA—are called anabolic reactions or anabolism. The useful forms of 
energy that are produced in catabolism are employed in anabolism to gen- 
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Coupled Reactions 


FIGURE 14.2 Metabolic pathways. 

[From the Kyoto Encyclopedia of Genes and 
Genomes (www.genome.ad.jp/kegg).] 


erate complex structures from simple ones, or energy-rich States from 
energy-poor ones. 

Useful energy + small molecules complex molecules 


Some pathways can be either anabolic or catabolic, depending on the en¬ 
ergy conditions in the cell. They are referred to as amphibolic pathways. 


14.1.1 A Thermodynamically Unfavorable Reaction Can Be 
Driven by a Favorable Reaction 


CONCEPTUAL INSICHTS, Energetic Coupling, offers a graphical presentation of 
how enzymatic coupling enables a favorable reaction to drive an unfavorable reaction. 


How are specific pathways constructed from individual reactions? A path- 
way must satisfy minimally two criteria: (1) the individual reactions must 
be specific and (2) the entire set of reactions that constitute the pathway must 
be thermodynamically favored. A reaction that is specific will yield only one 
particular product or set of products from its reactants. As discussed in 
Chapter 8, a function of enzymes is to provide this specificity. The ther- 
modynamics of metabolism is most readily approached in terms of free en¬ 
ergy, which was discussed in Sections 1.3.3, 8.2.1, and 8.2.2. A reaction can 
occur spontaneously only if AG, the change in free energy, is negative. Re- 
call that AG for the formation of products C and D from substrates A and 
B is given by 

Thus, the AG of a reaction depends on the nature of the reactant products 
(expressed by the AG°' term, the standard free-energy change) and on their 
concentrations (expressed by the second term). 
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and Design energy changes of the individual steps. Consider the following reactions: 

A V + C AG°' = +5 kcalmoP^ (+21 kJmoP^) 

B V " D AG°' = — 8 kcal moG^ ( —34 kj moG^) 

A^^C + D AG°' = -3kcalmoG^(-13kJmoG^) 

Under standard conditions, A cannot be spontaneously converted into 
B and C, because AG is positive. However, the conversion of B into D un¬ 
der standard conditions is thermodynamically feasible. Because free- 
energy changes are additive, the conversion of A into C and D has a AG°' of 
— 3 kcal moG^ ( — 13 kj moG^), which means that it can occur sponta¬ 
neously under standard conditions. Thus, a thermodynamically unfavorable 
reaction can be driven by a thermodynamically favorable reaction to which it 
is coupled. In this example, the chemical intermediate B, common to both 
reactions, couples the reactions. Thus, metabolic pathways are formed by 
the coupling of enzyme-catalyzed reactions such that the overall free en¬ 
ergy of the pathway is negative. 

14.1.2 ATP Is the Universal Currency of Free Energy 
in Biological Systems 

Just as commerce is facilitated by the use of a common currency, the com- 
merce of the cell—metabolism—is facilitated by the use of a common en¬ 
ergy currency, adenosine triphosphate (ATP). Part of the free energy derived 
from the oxidation of foodstuffs and from light is transformed into this 
highly accessible molecule, which acts as the free-energy donor in most 
energy-requiring processes such as motion, active transport, or biosynthesis. 

ATP is a nucleotide consisting of an adenine, a ribose, and a triphos¬ 
phate unit (Figure 14.3). The active form of ATP is usually a complex of 
ATP with Mg^^ or Mn^^ (Section 9.4.2). In considering the role of ATP 
as an energy carrier, we can focus on its triphosphate moiety. ATP is an 
energy-rich molecule because its triphosphate unit contains two phosphoanhydride 
bonds. A large amount of free energy is liberated when ATP is hydrolyzed to 
adenosine diphosphate (ADP) and orthophosphate (Pi) or when ATP is hy¬ 
drolyzed to adenosine monophosphate (AMP) and pyrophosphate (PPi). 

ATP + H20^^ADP + Pi 

AG°' = —7.3 kcal moG^ ( — 30.5 kJ moG^) 

ATP + H20^^AMP + PPi 

AG°' = —10.9 kcal moG^ ( — 45.6 kJ moG^) 

The precise AG°' for these reactions depends on the ionic strength of the 
médium and on the concentrations of Mg^^ and other metal ions. Under 
typical cellular concentrations, the actual AG for these hydrolyses is 
approximately —12 kcal moG^ ( —50 kJ moG^). 

The free energy liberated in the hydrolysis of ATP is harnessed to drive 
reactions that require an input of free energy, such as muscle contraction. 
In turn, ATP is formed from ADP and Pi when fuel molecules are oxidized 
in chemotrophs or when light is trapped by phototrophs. This ATP-ADP 
cycle is the fundamental mode of energy exchange in biological systems. 

Some biosynthetic reactions are driven by hydrolysis of nucleoside 
triphosphates that are analogous to ATP—namely, guanosine triphosphate 
(GTP), uridine triphosphate (UTP), and cytidine triphosphate (CTP). The 
diphosphate forms of these nucleotides are denoted by GDP, UDP, and 
CDP, and the monophosphate forms by GMP, UMP, and CMP. Enzymes 
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Adenosine triphosphate (ATP) 


Adenosine diphosphate (ADP) 



Adenosine monophosphate (AMP) 


FIGURE 14.3 Structures of ATP, ADP, and AMP. These 
adenylates consist of adenine (blue), a ribose (black), and a tri-, 
di-, or monophosphate unit (red). The innermost phosphorus 
atom of ATP is designated P^, the middie one Pp, and the 
outermost one P^. 


can catalyze the transfer of the terminal phosphoryl group from one nu- 
cleotide to another. The phosphorylation of nucleoside monophosphates is 
catalyzed by a family of nucleoside monophosphate kinases, as discussed in 
Section 9.4.1. The phosphorylation of nucleoside diphosphates is catalyzed 
by nucleoside diphosphate kinase, an enzyme with broad specificity. It is 
intriguing to note that, although all of the nucleotide triphosphates are en- 
ergetically equivalent, ATP is nonetheless the primary cellular energy car- 
rier. In addition, two important electron carriers, NAD^ and FAD, are de- 
rivatives of ATP. The role of ATP in energy metabolism is paramount. 

14.1.3 ATP Hydrolysis Drives Metabolism by Shifting the 
Equilibrium of Coupled Reactions 

How does coupling to ATP hydrolysis make possible an otherwise unfavor- 
able reaction? Consider a chemical reaction that is thermodynamically un- 
favorable without an input of free energy, a situation common to many 
biosynthetic reactions. Suppose that the standard free energy of the conver- 
sion of compound A into compound B is +4.0 kcal moP^ ( + 13 kj moP^): 

A ^ " B AG°' = +4.0 kcal moP^ (+13 kJ moP^) 

The equilibrium constant fCgq of this reaction at 25°C is related to AG°' (in 
units of kilocalories per mole) by 

= [B]eq/[A]eq = = 1.15 X 10-' 

Thus, net conversion of A into B cannot occur when the molar ratio of B to 
A is equal to or greater than 1.15 X 10~^. However, A can be converted 
into B under these conditions if the reaction is coupled to the hydrolysis of 
ATP. The new overall reaction is 

A + ATP + H20^^B + ADP + Pi + H+ 

AG°' = —3.3 kcal moD^ ( — 13.8 kJ moD^) 

Its standard free-energy change of —3.3 kcal moP^ ( — 13.8 kJ moP^) is 
the sum of the value of AG°' for the conversion of A into B [ + 4.0 kcal 
moP^ ( + 12.6 kJ moP^)] and the value of AG°' for the hydrolysis of ATP 
[ — 7.3 kcal moP^ ( — 30.5 kJ moP^)]. At pH 7, the equilibrium constant 
of this coupled reaction is 








= io3'3/i-36= 2.67 X 10^ 
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[A],, [ATP],, 

At equilibrium, the ratio of [B] to [A] is given by 



[A]e, 


The ATP-generating system of cells maintains the [ATP]/[ADP][Pi] ratio 
at a high levei, typically of the order of 500 M~b For this ratio, 

TRl 

-r- = 2.67 X 10^ X 500 = 1.34 X 10^ 

[A]eq 

which means that the hydrolysis of ATP enables A to be converted into B 
until the [B]/[A] ratio reaches a value of 1.34 X 10^. This equilibrium ra¬ 
tio is strikingly different from the value of 1.15 X 10~^ for the reaction 
A —> B in the absence of ATP hydrolysis. In other words, coupling the 
hydrolysis of ATP with the conversion of A into B has changed the equi¬ 
librium ratio of B to A by a factor of about 10^. 

We see here the thermodynamic essence of ATP’s action as an energy- 
coupling agent. Cells maintain a high levei of ATP by using oxidizable sub- 
strates or light as sources of free energy. The hydrolysis of an ATP mole- 
cule in a coupled reaction then changes the equilibrium ratio of products to 
reactants by a very large factor, of the order of 10^. More generally, the hy¬ 
drolysis of n ATP molecules changes the equilibrium ratio of a coupled re¬ 
action (or sequence of reactions) by a factor of lO^’^. For example, the hy¬ 
drolysis of three ATP molecules in a coupled reaction changes the 
equilibrium ratio by a factor of 10^"^. Thus, a thermodynamically unfavorable 
reaction sequence can be converted into a favorable one by coupling it to the 
hydrolysis of a sufficient number of ATP molecules in a new reaction. It should 
also be emphasized that A and B in the preceding coupled reaction may be 
interpreted very generally, not only as different chemical species. For ex¬ 
ample, A and B may represent activated and unactivated conformations of 
a protein; in this case, phosphorylation with ATP may be a means of con¬ 
version into an activated conformation. Such a conformation can store free 
energy, which can then be used to drive a thermodynamically unfavorable 
reaction. Through such changes in conformation, molecular motors such as 
myosin, kinesin, and dynein convert the chemical energy of ATP into me- 
chanical energy (Chapter 34). Indeed, this conversion is the basis of mus- 
cle contraction. 

Alternatively, A and B may refer to the concentrations of an ion or mol- 
ecule on the outside and inside of a cell, as in the active transport of a nu- 
trient. The active transport of Na^ and across membranes is driven by 
the phosphorylation of the sodium-potassium pump by ATP and its sub- 
sequent dephosphorylation (Section 13.2.1). 

14.1.4 Structural Basis of the High Phosphoryl Transfer 
Potential of ATP 

As illustrated by molecular motors (Chapter 34) and ion pumps (Section 
13.2), phosphoryl transfer is a common means of energy coupling. Fur- 
thermore, as we shall see in Chapter 15, phosphoryl transfer is also widely 
used in the intracellular transmission of information. What makes ATP a 
particularly efficient phosphoryl-group donor? Let us compare the standard 
free energy of hydrolysis of ATP with that of a phosphate ester, such as 
glycerol 3-phosphate: 








ATP + H20^ 


Coupled Reactions 


ADP + Pi + H+ 

AG°' = —7.3 kcal moP^ ( — 30.5 kj moP^) 

Glycerol 3-phosphate + H 2 O ^ glycerol + Pi 

AG°' = — 2.2 kcal mol“^ ( — 9.2 kj mol“^) 

The magnitude of AG°' for the hydrolysis of glycerol 3-phosphate is much 
smaller than that of ATP, which means that ATP has a stronger tendency 
to transfer its terminal phosphoryl group to water than does glycerol 3-phos¬ 
phate. In other words, ATP has a higher phosphoryl transfer potential 
(phosphoryl-group transfer potential) than does glycerol 3-phosphate. 

What is the structural basis of the high phosphoryl transfer potential of 
ATP? Because AG°' depends on the difference in free energies of the prod- 
ucts and reactants, the structures of both ATP and its hydrolysis products, 
ADP and Pi, must be examined to answer this question. Three factors are 
important: resonance stahilization, electrostatic repulsion, and stabilization 
due to hydration. ADP and, particularly. Pi, have greater resonance stabi¬ 
lization than does ATP. Orthophosphate has a number of resonance forms 
of similar energy (Figure 14.4), whereas the 7 -phosphoryl group of ATP 
has a smaller number. 
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FIGURE 14.4 Resonance structures of orthophosphate. 

Forms like that shown in Figure 14.5 are unfavorable because a positively 
charged oxygen atom is adjacent to a positively charged phosphorus atom, 
an electrostatically unfavorable juxtaposition. Furthermore, at pH 7, the 
triphosphate unit of ATP carries about four negative charges. These charges 
repel one another because they are in dose proximity. The repulsion be- 
tween them is reduced when ATP is hydrolyzed. Finally, water can bind 
more effectively to ADP and Pi than it can to the phosphoanhydride part 
of ATP, stabilizing the ADP and Pi by hydration. 

ATP is often called a high-energy phosphate compound, and its phospho¬ 
anhydride bonds are referred to as high-energy bonds. Indeed, a ‘‘squiggle’' 
(~P) is often used to indicate such a bond. Nonetheless, there is nothing 
special about the bonds themselves. They are high-energy bonds in the sense 
that much free energy is released when they are hydrolyzed, for the aforegiven 
reasons. 

14.1.5 Phosphoryl Transfer Potential Is an Important Form 
of Cellular Energy Transformation 

The standard free energies of hydrolysis provide a convenient means of com- 
paring the phosphoryl transfer potential of phosphorylated compounds. 
Such comparisons reveal that ATP is not the only compound with a high 
phosphoryl transfer potential. In fact, some compounds in biological Sys¬ 
tems have a higher phosphoryl transfer potential than that of ATP. These 
compounds include phosphoenolpyruvate (PEP), 1,3-bisphosphoglycerate 
(1,3-BPG), and creatine phosphate (Figure 14.6). Thus, PEP can transfer 
its phosphoryl group to ADP to form ATP. Indeed, this is one of the ways 
in which ATP is generated in the breakdown of sugars (Sections 14.2.1, 
16.1.6, and 16.1.7). It is significant that ATP has a phosphoryl transfer po¬ 
tential that is intermediate among the biologically important phosphory¬ 
lated molecules (Table 14.1). This intermediate position enables ATP tofunc- 
tion efficiently as a carrier of phosphoryl groups. 


FIGURE 14.5 Improbable resonance 
structure. The structure contributes littie 
to the terminal part of ATP, because two 
positive charges are placed adjacent to 
each other. 






Phosphoenolpyruvate 



Creatine phosphate 






1,3-Bisphosphoglycerate 


FIGURE 14.6 High phosphoryl transfer 
potential compounds. These compounds 
have a higher phosphoryl transfer potential 
than that of ATP and can be used to 
phosphorylate ADP to form ATP. 
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FIGURE 14.7 Sources of ATP during 
exercise. In the initial seconds, exercise 
is powered by existing high phosphoryl 
transfer compounds (ATP and creatine 
phosphate). Subsequentiy, the ATP must 
be regenerated by metabolic pathways. 


I- 

TABLE 14.1 Standard free energies of hydrolysis of some phosphorylated 


compounds 


Compound 

kcal mol ^ 

kJ mol ^ 

Phosphoenolpyruvate 

-14.8 

-61.9 

1,3-Bisphosphoglycerate 

-11.8 

-49.4 

Creatine phosphate 

-10.3 

-43.1 

ATP (to ADP) 

- 7.3 

-30.5 

Glucose 1-phosphate 

- 5.0 

-20.9 

Pyrophosphate 

- 4.6 

-19.3 

Glucose 6-phosphate 

- 3.3 

-13.8 

Glycerol 3-phosphate 

- 2.2 

- 9.2 


Creatine phosphate in vertebrate muscle serves as a reservoir of high- 
potential phosphoryl groups that can be readily transferred to ATP. Indeed, 
we use creatine phosphate to regenerate ATP from ADP every time we ex¬ 
ercise strenuously. This reaction is catalyzed by creatine kinase. 


Creatine kinase 

Creatine phosphate + ADP + v " ATP + creatine 


At pH 7, the standard free energy of hydrolysis of creatine phosphate is 
— 10.3 kcal moP^ ( — 43.1 kj moP^), compared with —7.3 kcal moP^ 
( — 30.5 kj moP^) for ATP. Hence, the standard free-energy change inform- 
ing ATP from creatine phosphate is —3.0 kcal moP^ ( — 12.6 kJ moP^), 
which corresponds to an equilibrium constant of 162. 


[ATP] [creatine] 
[ADP] [creatinephosphate] 


|q-AG°7(2.303RT) ^ IQ3/1.36 ^ |52 


In resting muscle, typical concentrations of these metabolites are [ATP] 
= 4 mM, [ADP] = 0.013 mM, [creatine phosphate] = 25 mM, and [crea¬ 
tine] = 13 mM. The amount of ATP in muscle suffices to sustain contrac- 
tile activity for less than a second. The abundance of creatine phosphate and 
its high phosphoryl transfer potential relative to that of ATP make it a highly 
effective phosphoryl buffer. Indeed, creatine phosphate is the major source 
of phosphoryl groups for ATP regeneration for a runner during the first 4 
seconds of a 100-meter sprint. After that, ATP must be generated through 
metabolism (Figure 14.7). 



14.2 THE OXIDATION OF CARBON FUELS IS AN 
IMPORTANT SOURCE OF CELLULAR ENERGY 


ATP serves as the principal immediate donor of free energy in biological Sys¬ 
tems rather than as a long-term storage form of free energy. In a typical cell. 
















an ATP molecule is consumed within a minute of its formation. Although 
the total quantity of ATP in the body is limited to approximately 100 g, the 
turnover of this small quantity of ATP is very high. For example, a resting 
human being consumes about 40 kg of ATP in 24 hours. During strenuous 
exertion, the rate of utilization of ATP may be as high as 0.5 kg/minute. 
For a 2-hour run, 60 kg (132 pounds) of ATP is utilized. Clearly, it is vital 
to have mechanisms for regenerating ATP. Motion, active transport, signal 
amplification, and biosynthesis can occur only if ATP is continually re- 
generated from ADP (Figure 14.8). The generation of ATP is one of the 
primary roles of catabolism. The carbon in fuel molecules—such as glucose 
and fats—is oxidized to CO 2 , and the energy released is used to regenerate 
ATP from ADP and Pi. 

In aerobic organisms, the ultimate electron acceptor in the oxidation of 
carbon is O 2 and the oxidation product is CO 2 . Consequently, the more re- 
duced a carbon is to begin with, the more exergonic its oxidation will be. 
Figure 14.9 shows the AG°' of oxidation for one-carbon compounds. 


Motion 

Active transport 
Biosyntheses 
Signal amplification 



Oxidation of fuel 
molecules 
or 

Photosynthesis 


FIGURE 14.8 ATP-ADP cycle. This cycle 
is the fundamental mode of energy 
exchange in biological Systems. 
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FIGURE 14.9 Free energy of oxidation of single-carbon compounds. 


Although fuel molecules are more complex (Figure 14.10) than the single- 
carbon compounds depicted in Figure 14.9, when a fuel is oxidized, the 
oxidation takes place one carbon at a time. The carbon oxidation energy is 
used in some cases to create a compound with high phosphoryl transfer po- 
tential and in other cases to create an ion gradient. In either case, the end 
point is the formation of ATP. 



Glucose Fatty acid 


FIGURE 14.10 Prominent fuels. Fats 
are a more efficient fuel source than 
carbohydrates such as glucose because 
the carbon in fats is more reduced. 


14.2.1 High Phosphoryl Transfer Potential Compounds Can 
Couple Carbon Oxidation to ATP Synthesis 

How is the energy released in the oxidation of a carbon compound converted 
into ATP? As an example, consider glyceraldehyde 3-phosphate (shown in 
the margin), which is a metabolite of glucose formed in the oxidation of that 
sugar. The C-1 carbon (shown in red) is a component of an aldehyde and is 
not in its most oxidized State. Oxidation of the aldehyde to an acid will re- 
lease energy. 


H, 



H—C—OH 


CH20P03^“ 


Glyceraldehyde 3-phosphate 
(GAP) 
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CH 2 OPO 32 - CH20P03^" 

Glyceraldehyde 3-phosphate 3-Phosphoglyceric acid 




H—C—OH 




.OH 


Oxidation 


H—C—OH 


However, the oxidation does not take place directly. Instead, the carbon ox¬ 
idation generates an acyl phosphate, 1,3-bisphosphoglycerate. The elec- 
trons released are captured by NAD^, which we will consider shortly. 


H—C OH + NAD+ + HP04^- 

CH20P03^" 

Glyceraldehyde 3-phosphate 
(GAP) 


O. .OPO 32 - 

H—C—OH + NADH + H+ 

I 

CHzOPOs^" 

1,3-Bisphosphoglycerate 
(1,3-BPG) 


For reasons similar to those discussed for ATP (Section 14.1.4), 1,3- 
bisphosphoglycerate has a high phosphoryl transfer potential. Thus, the 
cleavage of 1,3-BPG can be coupled to the synthesis of ATP. 



H—C-OH + ADP 

CH20P03^" 

1,3-Bisphosphoglycerate 


O. .OH 

H—C—OH + ATP 

CHjOPOs^- 
3-Phosphoglyceric acid 


The energy of oxidation is initially trapped as a high-energy phosphate com- 
pound and then used toform ATP. The oxidation energy of a carbon atom is 
transformed into phosphoryl transfer potential, first as 1,3-bisphospho¬ 
glycerate and ultimately as ATP. We will consider these reactions in mech- 
anistic detail in Section 16.1.5. 



FIGURE 14.11 Proton gradíents. The 

oxidation of fuels can power the formation 
of proton gradients. These proton gradients 
can in turn drive the synthesis of ATP. 


14.2.2 lon Gradients Across Membranes Provide an Important 
Form of Cellular Energy That Can Be Coupled to ATP Synthesis 

The electrochemical potential of ion gradients across membranes, produced 
by the oxidation of fuel molecules or by photosynthesis, ultimately powers 
the synthesis of most of the ATP in cells. In general, ion gradients are versa- 
tile means of coupling thermodynamically unfavorable reactions to favorable 
ones. Indeed, in animais, proton gradients generated by the oxidation of car¬ 
bon fuels account for more than 90% of ATP generation (Figure 14.11). This 
process is called oxidative phosphorylation (Chapter 18). ATP hydrolysis 
can then be used to form ion gradients of different types and functions. The 
electrochemical potential of a Na^ gradient, for example, can be tapped to 
pump Ca^^ out of cells (Section 13.4) or to transport nutrients such as sug- 
ars and amino acids into cells. 

14.2.3 Stages in the Extraction of Energy from Foodstuffs 

Let us take an overall view of the processes of energy conversion in higher 
organisms before considering them in detail in subsequent chapters. Hans 
Krebs described three stages in the generation of energy from the oxidation 
of foodstuffs (Figure 14.12). 

In the first stage, large molecules infood are broken down into smaller units. 
Proteins are hydrolyzed to their 20 kinds of constituent amino acids, poly- 











saccharides are hydrolyzed to simple sugars such as glucose, 
and fats are hydrolyzed to glycerol and fatty acids. This stage 
is strictly a preparation stage; no useful energy is captured in 
this phase. 

In the second stage, these numerous small molecules are de- 
graded to a few simple units that play a central role in metab- 
olism. In fact, most of them—sugars, fatty acids, glycerol, and 
several amino acids—are converted into the acetyl unit of 
acetyl Co A (Section 14.3.1). Some ATP is generated in this 
stage, but the amount is small compared with that obtained 
in the third stage. 

In the third stage, ATP is produced from the complete oxi- 
dation of the acetyl unit of acetyl CoA. The third stage consists 
of the citric acid cycle and oxidative phosphorylation, which 
are the final common pathways in the oxidation offuel molecules. 

Acetyl CoA brings acetyl units into the citric acid cycle [also 
called the tricarboxylic acid (TCA) cycle or Krebs cycle], 
where they are completely oxidized to CO 2 . Four pairs of elec- 
trons are transferred (three to NAD^ and one to FAD) for 
each acetyl group that is oxidized. Then, a proton gradient is 
generated as electrons flow from the reduced forms of these 
carriers to O 2 , and this gradient is used to synthesize ATP. FIGURE 14.12 Stages of catabolísm. The extraction of 

energy from fuels can be divided into three stages. 



14.3 METABOLIC PATHWAYS CONTAIN MANY 
RECURRING MOTIFS 

At first glance, metabolism appears intimidating because of the sheer num- 
ber of reactants and reactions. Nevertheless, there are unifying themes that 
make the comprehension of this complexity more manageable. These uni¬ 
fying themes include common metabolites, reactions, and regulatory schemes 
that stem from a common evolutionary heritage. 


14.3.1 Activated Carriers Exemplify the Modular Design and 
Economy of Metabolism 

We have seen that phosphoryl transfer can be used to drive otherwise en- 
dergonic reactions, alter the energy of conformation of a protein, or serve as 
a signal to alter the activity of a protein. The phosphoryl-group donor in all 
of these reactions is ATP. In other words, ATP is an activated carrier of 
phosphoryl groups because phosphoryl transfer from ATP is an exergonic 
process. The use of activated carriers is a recurring motif in biochemistry, 
and we will consider several such carriers here. 

1. Activated carriers of electrons for fuel oxidation. In aerobic organisms, the 
ultimate electron acceptor in the oxidation of fuel molecules is O 2 . How- 
ever, electrons are not transferred directly to O 2 . Instead, fuel molecules 
transfer electrons to special carriers, which are either pyridine nucleotides 
or fiavins. The reduced forms of these carriers then transfer their high- 
potential electrons to O 2 . 

Nicotinamide adenine dinucleotide is a major electron carrier in the ox¬ 
idation of fuel molecules (Figure 14.13). The reactive part of NAD^ is its 
nicotinamide ring, a pyridine derivative synthesized from the vitamin 
niacin. In the oxidation of a substrate, the nicotinamide ring of NAD^ ac- 
cepts a hydrogen ion and two electrons, which are equivalent to a hydride ion. 


Reactive site 



FIGURE 14.13 Structures of the 
oxidized forms of nicotinamide-derived 
electron carriers. Nicotinamide adenine 
dinucleotide (NAD^) and nicotinamide 
adenine dinucleotide phosphate (NADPA 
are prominent carriers of high-energy 
electrons. In NAD+, R = H; in NADP+, 

R = P03^“. 
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FIGURE 14.14 Structure of the oxidízed 
form of flavin adeníne dínucleotíde 
(FAD). This electron carrier consists of a 
flavin mononudeotide (FMN) unit (shown 
in blue) and an AMP unit (shown in black). 



The reduced form of this carrier is called NADH. In the oxidized form, the 
nitrogen atom carries a positive charge, as indicated by NAD^. NAD^ is 
the electron acceptor in many reactions of the type 

OH O 

.C + NAD+ , .C + NADH + H+ 

\^R' R^ ^R' 

H 

In this dehydrogenation, one hydrogen atom of the substrate is directly 
transferred to NAD^, whereas the other appears in the solvent as a proton. 
Both electrons lost by the substrate are transferred to the nicotinamide ring. 

The other major electron carrier in the oxidation of fuel molecules is the 
coenzyme flavin adenine dinucleotide (Figure 14.14). The abbreviations for 
the oxidized and reduced forms of this carrier are FAD and FADH 2 , re- 
spectively. FAD is the electron acceptor in reactions of the type 


H H 

^ A 

H H 


+ FAD 


R R' 

+ FADH 2 

H H 



The reactive part of FAD is its isoalloxazine ring, a derivative of the vita- 
min riboflavin (Figure 14.15). FAD, like NAD^, can accept two electrons. 
In doing so, FAD, unlike NAD^, takes up two protons. These carriers of 
high-potential electrons as well as flavin mononudeotide (FMN), an elec¬ 
tron carrier related to FAD, will be considered further in Chapter 18. 

2. An activated carrier of electrons for reductive biosynthesis. High-potential 
electrons are required in most biosyntheses because the precursors are more 
oxidized than the products. Hence, reducing power is needed in addition to 


FIGURE 14.15 Structures of 
the reactive parts of FAD 
and FADH 2 . The electrons and 
protons are carried by the 
isoalloxazine ring component of 
FAD and FADH 2 . 



Oxidized form 
(FAD) 



Reduced form 
(FADH 2 ) 
















ATP. For example, in the biosynthesis of fatty acids, the keto group of an 
added two-carbon unit is reduced to a methylene group in several steps. 
This sequence of reactions requires an input of four electrons. 



+ 4 e" 



+ H2O 


The electron donor in most reductive biosyntheses is NADPH, the reduced 
form of nicotinamide adenine dinucleotide phosphate (NADP^; see Figure 
14.13). NADPH differs from NADH in that the 2'-hydroxyl group of its 
adenosine moiety is esterified with phosphate. NADPH carries electrons in 
the same way as NADH. However, NADPH is used almost exclusively for 
reductive biosyntheses, whereas NADH is used primarily for the generation of 
ATP. The extra phosphoryl group on NADPH is a tag that enables enzymes 
to distinguish between high-potential electrons to be used in anabolism and 
those to be used in catabolism. 
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3. An activated carrier of two-carbon fragments. Coenzyme A, another cen¬ 
tral molecule in metabolism, is a carrier of acyl groups (Figure 14.16). Acyl 
groups are important constituents both in catabolism, as in the oxidation of 
fatty acids, and in anabolism, as in the synthesis of membrane lipids. The ter¬ 
minal sulfhydryl group in Co A is the reactive site. Acyl groups are linked to 
Co A by thioester bonds. The resulting derivative is called an acyl CoA. An 
acyl group often linked to CoA is the acetyl unit; this derivative is called acetyl 
CoA. The AG°' for the hydrolysis of acetyl CoA has a large negative value: 

Acetyl CoA + H 2 O acetate + CoA + H^ 

AG°' = —7.5 kcal moD^ ( — 31.4 kj moD^) 

The hydrolysis of a thioester is thermodynamically more favorable than that 
of an oxygen ester because the electrons of the C=0 bond cannot form res- 
onance structures with the C-S bond that are as stable as those that they 
can form with the C-O bond. Consequently, acetyl CoA has a high acetyl 
potential (acetyl group-transfer potential) because transfer of the acetyl group 
is exergonic. Acetyl CoA carries an activated acetyl group, just as ATP car¬ 
ries an activated phosphoryl group. 

The use of activated carriers illustrates two key aspects of metabolism. 
First, NADH, NADPH, and FADH 2 react slowly with O 2 in the absence 
of a catalyst. Likewise, ATP and acetyl CoA are hydrolyzed slowly (in 
times of many hours or even days) in the absence of a catalyst. These 
molecules are kinetically quite stable in the face of a large thermodynamic 
driving force for reaction with O 2 (in regard to the electron carriers) and 
H 2 O (in regard to ATP and acetyl CoA). The kinetic stability of these mol¬ 
ecules in the absence of specific catalysts is essential for their biological func- 
tion because it enables enzymes to control the flow of free energy and reduc- 
ing power. 


O O 



Acyl CoA Acetyl CoA 



(3-Mercapto- Pantothenate unit 

ethylamine unit 


FIGURE 14.16 Structure of 
coenzyme A (CoA). 
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TABLE 14.2 Some activated carriers in metabolism 


Carrier molecule in activated form 

Group carried 

Vitamin precursor 

ATP 

Phosphoryl 


NADH and NADPH 

Electrons 

Nicotinate (niacin) 

FADH 2 

Electrons 

Riboflavin (vitamin B 2 ) 

FMNH 2 

Electrons 

Riboflavin (vitamin B 2 ) 

Coenzyme A 

Acyl 

Pantothenate 

Lipoamide 

Acyl 


Thiamine pyrophosphate 

Aldehyde 

Thiamine (vitamin B^) 

Biotin 

GO 2 

Biotin 

Tetrahydrofolate 

One-carbon units 

Folate 

S - Adenosylmethionine 

Methyl 


Uridine diphosphate glucose 

Glucose 


Gytidine diphosphate diacylglycerol 

Phosphatidate 


Nucleoside triphosphates 

Nucleotides 



Note: Many of the activated carriers are coenzymes that are derived from water-soluble 
vitamins (Section 8.6.1). 


Second, most interchanges of activated groups in metabolism are accom- 
plished by a rather small set of carriers (Table 14.2). The existence of a re- 
curring set of activated carriers in all organisms is one of the unifying mo- 
tifs of biochemistry. Furthermore, it illustrates the modular design of 
metabolism. A small set of molecules carries out a very wide range of tasks. 
Metabolism is readily comprehended because of the economy and elegance 
of its underlying design. 

14.3.2 Key Reactions Are Reiterated Throughout 
Metabolism 

Just as there is an economy of design in the use of activated carriers, so is 
there an economy of design in biochemical reactions. The thousands of meta- 
bolic reactions, bewildering at first in their variety, can be subdivided into 
just six types (Table 14.3). Specific reactions of each type appear repeatedly, 
further reducing the number of reactions necessary for the student to learn. 


r 


TABLE 14.3 Types of Chemical reactions in metabolism 


Type of reaction 

Description 

Oxidation-reduction 

Electron transfer 

Ligation requiring ATP 

Formation of covalent bonds (i.e., carbon- 

cleavage 

carbon bonds) 

Isomerization 

Rearrangement of atoms to form isomers 

Group transfer 

Transfer of a functional group from one 
molecule to another 

Hydrolytic 

Gleavage of bonds by the addition of water 

Addition or removal of 

Addition of functional groups to double 

functional groups 

bonds or their removal to form double 
bonds 














1. Oxidation—reduction reactions are essential components of many path- 
ways. Useful energy is often derived from the oxidation of carbon com- 
pounds. Consider the following two reactions: 


Recurring Motifs 


O 

- II H2 

O X + FAD 

H2 |: - 

ó 

Succinate 


O 






+ NAD-" 


Malate 


1 



+ FADH 2 

1 

H 

0 



Fumarate 




) 

1 H 2 





x^ 

_ 

+ NADH + H 

0 

0 



Oxaloacetate 




0 ) 


( 2 ) 


These two oxidation-reduction reactions are components of the citric acid 
cycle (Chapter 17), which completely oxidizes the activated two-carbon 
fragment of acetyl Co A to two molecules of CO 2 . In reaction 1, FADH 2 
carries the electrons, whereas, in reaction 2, electrons are carried by NADH. 
In biosynthetic oxidation-reduction reactions, NADPH is the reductant. 

2. Ligation reactions form bonds by using free energy from ATP cleavage. 
Reaction 3 illustrates the ATP-dependent formation of a carbon-carbon 
bond, necessary to combine smaller molecules to form larger ones. Ox- 
aloacetate is formed from pyruvate and CO 2 . 



The oxaloacetate can be used in the citric acid cycle or converted into amino 
acids such as aspartic acid. 

3. Isomerization reactions rearrange particular atoms within the molecule. 
Their role is often to prepare a molecule for subsequent reactions such as 
the oxidation—reduction reactions described in point 1. 



Citrate Isocitrate 


Reaction 4 is, again, a component of the citric acid cycle. This isomeriza¬ 
tion prepares the molecule for subsequent oxidation and decarboxylation 
by moving the hydroxyl group of citrate from a tertiary to a secondary 
position. 

4. Group-transfer reactions play a variety of roles. Reaction 5 is representa¬ 
tivo of such a reaction. A phosphoryl group is transferred from the activated 
phosphoryl-group carrier, ATP, to glucose. This reaction traps glucose in 
the cell so that further catabolism can take place. 
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Glucose 


ATP 



+ 0 o 



adenine 


(5) 


Glucose 6-phosphate 
(G-6P) 


ADP 


We saw earlier that group-transfer reactions are used to synthesize ATP 
(14.2.1). We will also see examples of their use in signaling pathways (Chap- 
ter 15). 

5. Hydrolytic reactions cleave bonds by the addition of water. Hydrolysis is 
a common means employed to break down large molecules, either to facil- 
itate further metabolism or to reutilize some of the components for biosyn- 
thetic purposes. Proteins are digested by hydrolytic cleavage (Chapters 9 
and 10). Reaction 6 illustrates the hydrolysis of a peptide to yield two smaller 
peptides. 



6. The addition of functional groups to double bonds or the removal of groups 
to form double bonds constitutes the final class of reactions. The enzymes 
that catalyze these types of reaction are classified as lyases (Section 8.1.3). 
An important example, illustrated in reaction 7, is the conversion of the six- 
carbon molecule fructose 1,6-bisphosphate (F-1, 6-BP) into 2 three-carbon 
fragments: dihydroxyacetone phosphate and glyceraldehyde 3-phosphate. 


^CH20P03^“ 


HO- 

H- 


H- 


-H 

-OH 

-OH 


CH20P03^ 


Fructose 1,6-bisphosphate 
(F-1,6-BP) 




XH2OPO32 


HO- 


-OH 

CHzOPOs^" 


(7) 


Dihydroxyacetone phosphate 
(DHAP) 


Glyceraldehyde 3-phosphate 
(GAP) 


This reaction is a key step in glycolysis, a key pathway for extracting en- 
ergy from glucose (Section 16.1.5). Dehydrations to form double bonds, 






















such as the formation of phosphoenolpyruvate (Table 14.1) from 2-phos- 
phoglycerate (reaction 8), are important reactions of this type. 


Recurring Motifs 



2-Phosphoglycerate Phosphoenolpyruvate 

(PEP) 


The dehydration sets up the next step in the pathway, a group-transfer re¬ 
action that uses the that uses the high phosphoryl transfer potential of the 
product PEP to form ATP from ADR 

These six fundamental reaction types are the basis of metabolism. Remem- 
ber that all six types can proceed in either direction, depending on the stan¬ 
dard free energy for the specific reaction and the concentration of the reac- 
tants and products inside the cell. As an example of how simple themes are 
reiterated, consider the reactions shown in Figure 14.17. The same sequence 
of reactions is employed in the citric acid cycle, fatty acid degradation, the 
degradation of the amino acid lysine, and (in reverse) the biosynthesis of 
fatty acids. An effective way to learn is to look for commonalties in the di- 
verse metabolic pathways that we will be studying. There is a Chemical logic 
that, when exposed, renders the complexity of the chemistry of living Sys¬ 
tems more manageable and reveals its elegance. 


Citric acid cycle 


Fatty acid degradation 


Fatty acid synthesis 


Lysine degradation 


H H 


-ooc 



H H 


^ FADH2 



O 


-OOC 



^NADH + m 


COO- 


H H 

0 

H H 


0 


0 = 






HOOCCH 2 -' 




V ^S-CoA 



^S-ACP 



/\ ^SCoA 

H 

H 

H 

H 


H H 


^ FADH 2 



^NADH + H+ 



^ FADH 2 

H 

0 

H 


0 


H 

0 

RCH2^^^'^ 





H00CCH2-^ 




^S-CoA 



^S-ACP 



^SCoA 

H 


H 




H 


/H 2 O 



^H20 



/H 2 O 

HO H 

0 

HO H 


0 

HO 

H 

0 

RCH2-'^^X 





H3C-^ 




V ^S-CoA 



^S-ACP 



^SCoA 

H 

H 

H 

H 


H 1 

H 


^NADH + H+ 



^NADH + H+ 



^NADH + H+ 

0 


0 



0 




0 



0 



0 

RCH2"^^ 

^S-CoA 

H3C^^ 


^S-ACP 

H3C^ 


^SCoA 


HH HH HH HH 

FIGURE 14.17 Metabolic motifs. Some metabolic pathways have similar sequences of 
reactions in common—in this case, an oxidation, the addition of a functional group (from a 
water molecule) to a double bond, and another oxidation. ACP designates acyl carrier 
protein. 
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Energy charge 

FIGURE 14.18 Energy charge regulates 
metabolism. High concentrations of 
ATP inhibit the relative rates of a typical 
ATP-generating (catabolic) pathway and 
stimulate the typical ATP-utilizing (anabolic) 
pathway. 


14.3.3 Metabolic Processes Are Regulated in Three 
Principal Ways 

It is evident that the complex network of reactions constituting intermedi- 
ary metabolism must be rigorously regulated. At the same time, metabolic 
control must be flexible, because the externai environments of cells are not 
constant. Metabolism is regulated by controlling (1) the amounts of enzymes, 
(2) their catalytic activities, and (3) the accessibility of substrates. The amount 
of a particular enzyme depends on both its rate of synthesis and its rate of 
degradation. The levei of most enzymes is adjusted primarily by changing 
the rate of transcription of the genes encoding them. In E. coli, the presence 
of lactose, for example, induces within minutes a more than 50-fold increase 
in the rate of synthesis of P-galactosidase, an enzyme required for the break- 
down of this disaccharide. 

The catalytic activity of enzymes is controlled in several ways. Reversible 
allosteric control is especially important. For example, the first reaction in 
many biosynthetic pathways is allosterically inhibited by the ultimate prod- 
uct of the pathway. The inhibition of aspartate transcarbamoylase by cyti- 
dine triphosphate (Section 10.1) is a well-understood example oí feedback 
inhibition. This type of control can be almost instantaneous. Another re- 
curring mechanism is reversible covalent modification. For example, glyco- 
gen phosphorylase, the enzyme catalyzing the breakdown of glycogen, a 
storage form of sugar, is activated by phosphorylation of a particular serine 
residue when glucose is scarce (Section 21.2.1). 

Hormones coordinate metabolic relations between different tissues, often 
by regulating the reversible modification of key enzymes. Hormones such 
as epinephrine trigger signal transduction cascades that lead to highly am- 
plified changes in metabolic patterns in target tissues such as the muscle 
(Section 15.0.1). The hormone insulin promotes the entry of glucose into 
many kinds of cells. As will be discussed again in Chapter 15, many hor¬ 
mones act through intracellular messengers, such as cyclic AMP and cal- 
cium ion, that coordinate the activities of many target proteins. 

Controlling the^ux of substrates also regulates metabolism. The trans- 
fer of substrates from one compartment of a cell to another (e.g., from the 
cytosol to mitochondria) can serve as a control point. 

An important general principie of metabolism is that biosynthetic and 
degradative pathways are almost always distinct. This separation is necessary 
for energetic reasons, as will be evident in subsequent chapters. It also fa- 
cilitates the control of metabolism. In eukaryotes, metabolic regulation and 
flexibility also are enhanced by compartmentalization. For example, fatty 
acid oxidation takes place in mitochondria, whereas fatty acid synthesis 
takes place in the cytosol. Compartmentalization segregates opposed reactions. 

Many reactions in metabolism are controlled by the energy status of the 
cell. One index of the energy status is the energy charge, which is propor- 
tional to the mole fraction of ATP plus half the mole fraction of ADP, given 
that ATP contains two anhydride bonds, whereas ADP contains one. 
Hence, the energy charge is defined as 

[ATP] +'/j[ADP] 

Energy charge = [^TP] + [ADP] + [AMP] 

The energy charge can have a value ranging from 0 (all AMP) to 1 (all ATP). 
Daniel Atkinson showed that ATP-generating (catabolic) pathways are in¬ 
hibited by an energy charge, whereas ATP-utilizing (anabolic) pathways are 
stimulated by a high-energy charge. In plots of the reaction rates of such path¬ 
ways versus the energy charge, the curves are steep near an energy charge of 
0.9, where they usually intersect (Figure 14.18). It is evident that the control 






of these pathways has evolved to maintain the energy charge within rather 
narrow limits. In other words, the energy charge, like the pH of a cell, is 
buffered. The energy charge of most cells ranges from 0.80 to 0.95. An al¬ 
ternativo index of the energy status is the phosphorylation potential, which 
is defined as 

[ATP] 

Phosphorylation potential = 

The phosphorylation potential, in contrast with the energy charge, depends 
on the concentration of Pi and is directly related to the free energy-storage 
available from ATP. 

14.3.4 Evolution of Metabolic Pathways 

Y How did the complex pathways that constitute metabolism evolve? 

This question, a difficult one to address, was approached in Chap- 
ter 2. The current thinking is that RNA was an early biomolecule and that, 
in an early RNA world, RNA served as catalysts and Information-stor age 
molecules (Section 2.2.2). 

Why do activated carriers such as ATP, NADH, FADH 2 , and coen- 
zyme A contain adenosine diphosphate units (Figure 14.19)? A possible ex- 
planation is that these molecules evolved from the early RNA catalysts. 
Non-RNA units such as the isoalloxazine ring may have been recruited 
to serve as efficient carriers of activated electrons and chemical units, a 
function not readily performed by RNA itself. We can picture the adenine 
ring of FADH 2 binding to a uracil unit in a niche of an RNA enzyme (ri- 
bozyme) by base-pairing, whereas the isoalloxazine ring protrudes and 
functions as an electron carrier. When the more versatile proteins replaced 
RNA as the major catalysts, the ribonucleotide coenzymes stayed essen- 
tially unchanged because they were already well suited to their metabolic 
roles. The nicotin amide unit of NADH, for example, can readily transfer 
electrons irrespective of whether the adenine unit interacts with a base in 
an RNA enzyme or with amino acid residues in a protein enzyme. With the 
advent of protein enzymes, these important cofactors evolved as free mol¬ 
ecules without losing the adenosine diphosphate vestige of their RNA- 
world ancestry. That molecules and motifs of metabolism are common to 
all forms of life testifies to their common origin and to the retention of func- 
tioning modules through billions of years of evolution. Our understanding 
of metabolism, like that of other biological processes, is enriched by inquiry 
into how these beautifully integrated patterns of reactions carne into being. 
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Coenzyme A 


SUMMARY 


All cells transform energy. They extract energy from their environment 
and use this energy to convert simple molecules into cellular components. 

o Metabolism Is Composed of Many Coupled, Interconnecting Reactions 

The process of energy transduction takes place through a highly inte¬ 
grated network of chemical reactions called metabolism. Metabolism can 
be subdivided into catabolism (reactions employed to extract energy 
from fuels) and anabolism (reactions that use this energy for biosynthe- 
sis). The most valuable thermodynamic concept for understanding 
bioenergetics is free energy. A reaction can occur spontaneously only if 
the change in free energy (AG) is negative. A thermodynamically unfa- 
vorable reaction can be driven by a thermodynamically favorable one, 
which in many cases is the hydrolysis of ATP. The hydrolysis of ATP 
shifts the equilibrium of a coupled reaction by a factor of about 10^. ATP, 


FIGURE 14.19 Adenosine diphosphate 
(ADP) is an ancient module in 
metabolism. This fundamental building 
block is present in key molecules such 
as ATP, NADH, FAD, and coenzyme A. 

The adenine unit is shown in blue, the 
ribose unit in red, and the diphosphate 
unit in yellow. 
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the universal currency of energy in biological systems, is an energy-rich 
molecule because it contains two phosphoanhydride bonds. 

• The Oxidation of Carbon Fuels Is an Important Source of Cellular Energy 

ATP formation is coupled to the oxidation of carbon fuels, either di- 
rectly or through the formation of ion gradients. Photosynthetic organ- 
isms can use light to generate such gradients. ATP is consumed in mus- 
cle contraction and other motions of cells, in active transport, in signal 
transduction processes, and in biosyntheses. There are three stages in 
the extraction of energy from foodstuffs by aerobic organisms. In the 
first stage, large molecules are broken down into smaller ones, such as 
amino acids, sugars, and fatty acids. In the second stage, these small 
molecules are degraded to a few simple units that have a pervasive role 
in metabolism. One of them is the acetyl unit of acetyl CoA, a carrier 
of activated acyl groups. The third stage of metabolism is the citric acid 
cycle and oxidative phosphorylation, in which ATP is generated as elec- 
trons flow to O 2 , the ultimate electron acceptor, and fuels are completely 
oxidized to CO 2 . 

• Metabolic Pathways Contain Many Recurring Motifs 

Metabolism is characterized by common motifs. A small number of ac¬ 
tivated carriers, such as ATP, NADH, and acetyl CoA, are used in many 
metabolic pathways. NADPH, which carries two electrons at a high po- 
tential, provides reducing power in the biosynthesis of cell components 
from more-oxidized precursors. ATP and NADPH are continually gen¬ 
erated and consumed. Most transfers of activated groups in metabolism 
are mediated by a recurring set of carriers. Moreover, key reaction types 
are used repeatedly in metabolic pathways. 

Metabolism is regulated in a variety of ways. The amounts of some 
criticai enzymes are controlled by regulation of the rate of protein syn- 
thesis and degradation. In addition, the catalytic activities of many en¬ 
zymes are regulated by allosteric interactions (as in feedback inhibition) 
and by covalent modification. The movement of many substrates into 
cells and subcellular compartments also is controlled. Distinct pathways 
for biosynthesis and degradation contribute to metabolic regulation. 
The energy charge, which depends on the relative amounts of ATP, 
ADP, and AMP, plays a role in metabolic regulation. A high-energy 
charge inhibits ATP-generating (catabolic) pathways, whereas it stim- 
ulates ATP-utilizing (anabolic) pathways. 
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phototroph (p. 374) 
chemotroph (p. 374) 
metabolism or intermediary metabolism 
(p. 374) 

catabolism (p. 374) 
anabolism (p. 374) 
amphibolic pathway (p. 375) 


coupled reaction (p. 377) 
phosphoryl transfer potential (p. 379) 
oxidative phosphorylation (p. 382) 
activated carrier (p. 383) 
oxidation-reduction reaction (p. 387) 
ligation reaction (p. 387) 
isomerization reaction (p. 387) 


group-transfer reaction (p. 387) 
hydrolytic reaction (p. 388) 
addition to or formation of double-bond 
reaction (p. 388) 
energy charge (p. 390) 
phosphorylation potential (p. 391) 
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-1 PROBLEMS I- 

1. Energy flow. What is the direction of each of the foliowing 
reactions when the reactants are initially present in equimolar 
amounts? Use the data given in Table 14.1. 

(a) ATP + creatine ^ " creatine phosphate + ADP 

(b) ATP + glycerol^^-^glycerol 3-phosphate + ADP 

(c) ATP + pyruvate ^^ phosphoenolpyruvate + ADP 

(d) ATP + glucose -^glucose 6-phosphate + ADP 

2. A proper inference. What Information do the AG ° ' data given 
in Table 14.1 provide about the relative rates of hydrolysis of py- 
rophosphate and acetyl phosphate? 

3. A potent donor. Consider the following reaction: 

ATP + pyruvate ^^ phosphoenolpyruvate + ADP 

(a) Calculate AG°' and K'eq at 25°C for this reaction, by using 
the data given in Table 14.1. 

(b) What is the equilibrium ratio of pyruvate to phospho¬ 
enolpyruvate if the ratio of ATP to ADP is 10? 

4. Isomeric equilibrium. Calculate AG°' for the isomerization of 
glucose 6-phosphate to glucose 1-phosphate. What is the equilib¬ 
rium ratio of glucose 6-phosphate to glucose 1 -phosphate at 25°C? 

5. Activated acetate. The formation of acetyl Co A from acetate 
is an ATP-driven reaction: 

Acetate + ATP + CoA ^ - acetyl CoA + AMP + PP^ 


(a) Calculate AG°' for this reaction by using data given in this 
chapter. 

(b) The PPi formed in the preceding reaction is rapidly hy- 
drolyzed in vivo because of the ubiquity of inorganic pyrophos- 
phatase. The AG°' for the hydrolysis of PPi is —4.6 kcal moU^. 
Calculate the AG°' for the overall reaction. What effect does the 
hydrolysis of PPi have on the formation of acetyl CoA? 

6. Acid strength. The pK of an acid is a measure of its proton- 
group-transfer potential. 

(a) Derive a relation between AG°' and pK. 

(b) What is the AG°' for the ionization of acetic acid, which has 
a pK of 4.8? 

7. Raison d’être. The muscles of some invertebrates are rich 
in arginine phosphate (phosphoarginine). Propose a function 
for this amino acid derivative. How would you test your 
hypothesis? 



Arginine phosphate 
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8. Recurring motif. What is the structural feature common to 
ATP, FAD, NAD+, and CoA? 

9. Ergogenic help or hindrance? Creatine is a popular, but 
untested, dietary supplement. 

(a) What is the biochemical rationale for the use of creatine? 

(b) What type of exercise would most benefit from creatine 
supplementation? 

10. Standard conditions versus real life. The enzyme aldolase cat- 
alyzes the foliowing reaction in the glycolytic pathway: 

Aldolase 

Fructose 1, 6 bisphosphate- 

dihydroxyacetone phosphate + glyceraldehyde 3-phosphate 

The AG°' for the reaction is +5.7 kcal moP^, whereas the AG 
in the cell is —0.3 kcal moP^. Calculate the ratio of reactants to 
Products under equilibrium and intracellular conditions. Using 
your results, explain how the reaction can be endergonic under 
standard conditions and exergonic under intracellular condi¬ 
tions. 

11. Not all alike. The concentrations of ATP, ADP, and Pi dif- 
fer with cell type. Consequently, the release of free energy with 
the hydrolysis of ATP will vary with cell type. Using the fol¬ 
io wing table, calculate the AG for the hydrolysis of ATP in mus- 
cle, liver, and brain cells. In which cell type is the free energy of 
ATP hydrolysis greatest? 



ATP (mM) 

ADP (mM) 

Pi (mM) 

Liver 

3.5 

1.8 

5.0 

Muscle 

8.0 

0.9 

8.0 

Brain 

2.6 

0.7 

2.7 


12. Running downhill. Glycolysis is a series of 10 linked reac- 
tions that convert one molecule of glucose into two molecules of 
pyruvate with the concomitant synthesis of two molecules of 
ATP (Chapter 16). The AG°' for this set of reactions is —8.5 


kcal moU^ ( — 35.6 kj moU^), whereas the AG is —18.3 kcal 
moU^ ( — 76.6 kJ moU^). Explain why the free-energy release is 
so much greater under intracellular conditions than under stan¬ 
dard conditions. 

Chapter Integration Problem 

13. Activated sulfate. Fibrinogen contains tyrosine-O-sulfate. 
Propose an activated form of sulfate that could react in vivo with 
the aromatic hydroxyl group of a tyrosine residue in a protein to 
form tyrosine-O-sulfate. 

Data Interpretation 

14. Opposites attract. The following graph shows how the AG 
for the hydrolysis of ATP varies as a function of the Mg^^ con- 
centration (pMg = log l/[Mg^^]). 



(a) How does decreasing [Mg^^] affect the AG of hydrolysis for 
ATP? 

(b) How can you explain this effect? 

Media Problem 

15. Coupled reactions. The most obvious role of enzymes is to 
accelerate reactions, but a second criticai role is to couple reac¬ 
tions that would ordinarily be unrelated. From what you have 
learned from the text and the Conceptual Insights module on 
energetic coupling, explain why the coupling of chemical reac¬ 
tions in a single enzyme is criticai for life. 
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Glycolysis and Gluconeogenesis 




^ Glucose j 


2 Pyruvate 


0 ^ 


Glycolysis produces energy. Michael Johnson sprints to another 
victory in the 200-nneter semifinais of the 1996 Atlantic Olympics. 
Johnson, like anyone who sprints, requires a source of energy that can 
be rapidiy accessed. The anaerobic metabolism of glucose—the process 
of glycolysis—provides such a source of energy for short, intense bouts 
of exercise. [Simon Bruty/Allsport] 


The first metabolic pathway that we encounter is glycolysis, an ancient 
pathway employed by a host of organisms. Glycolysis is the sequence of re- 
actions that metabolizes one molecule of glucose to two molecules of pyruvate 
with the concomitant net production of two molecules of ATP. This process is 
anaerobic (i.e., it does not require O 2 ) inasmuch as it evolved before the ac- 
cumulation of substantial amounts of oxygen in the atmosphere. Pyruvate 
can be further processed anaerobically (fermented) to lactate (lactic acidfer- 
mentation) or ethanol {alcoholic fermentation). Under aerobic conditions, 
pyruvate can be completely oxidized to CO 2 , generating much more ATP, 
as will be discussed in Chapters 18 and 19. 

Glucose can be synthesized from noncarbohydrate pre- 
cursors, such as pyruvate and lactic acid, in the process of 
gluconeogenesis. Although glycolysis and gluconeogenesis 
have some of the same enzymes in common, the two path- 
ways are not simply the reverse of each other. In particular, 
the highly exergonic, irreversible steps of glycolysis are by- 
passed in gluconeogenesis. Both pathways are stringently 
controlled by intercellular and intracellular signals, and 
they are reciprocally regulated so that glycolysis and gluco¬ 
neogenesis do not take place simultaneously in the same cell 
to a significant extent. 

Our understanding of glucose metabolism, especially 
glycolysis, has a rich history. Indeed, the development of 
biochemistry and the delineation of glycolysis went hand in 
hand. A key discovery was made by Hans Buchner and 
Eduard Buchner in 1897, quite by accident. The Buchners 


OUTLINE 


O 16.1 Glycolysis Is an Energy-Conversion 
Pathway in Many Organisms 

o 16.2 The Glycolytic Pathway Is Tightly 
Controlled 

o 16.3 Glucose Can Be Synthesized 
from Noncarbohydrate Precursors 

o 16.4 Gluconeogenesis and Glycolysis 
Are Reciprocally Regulated 
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Glycolysis— 

Derived from the Greek stem glyk-, "sweet," 
and the word lysis, "dissolution." 



















-1 16-2 I- 

CHAPTER 16 • Glycolysis and 
Gluconeogenesis 


I- 

Enzyme— 

A term coined by Friedrich Whilhelm 
Kühne in 1878 to designate catalytically 
active substances that had previousiy 
been called ferments. Derived from the 
Greek workds en, "in," and zyme, 
"leaven." 


I- 

Fermentation— 

An ATP-generating process in which 
organic compounds act as both donors 
and acceptors of electrons. Fermenta- 
tion can take place in the absence of 
O 2 . Discovered by Louis Pasteur, who 
described fermentation as "la vie sans 
Tair" ("life without air"). 


were interested in manufacturing cell-free extracts of yeast for possible ther- 
apeutic use. These extracts had to be preserved without the use of antisep- 
tics such as phenol, and so they decided to try sucrose, a commonly used 
preservative in kitchen chemistry. They obtained a startling result: sucrose 
was rapidly fermented into alcohol by the yeast juice. The significance of 
this finding was immense. The Buchners demonstrated for the first time that 
fermentation could take place outside living cells. The accepted view of their 
day, asserted by Louis Pasteur in 1860, was that fermentation is inextrica- 
bly tied to living cells. The chance discovery of the Buchners refuted this 
vitalistic dogma and opened the door to modern biochemistry. Metabolism 
became chemistry. 

Studies of muscle extracts then showed that many of the reactions of lac- 
tic acid fermentation were the same as those of alcoholic fermentation. This 
exciting discovery revealed an underlying unity in biochemistry. The complete 
glycolytic pathway was elucidated by 1940, largely through the pioneering 
contributions of Gustav Embden, Otto Meyerhof, Cari Neuberg, Jacob 
Parnas, Otto Warburg, Gerty Cori, and Cari Cori. Glycolysis is also known 
as the Embden-Meyerhof pathway. 

In our consideration of the glycolytic and gluconeogenic pathways, we 
shall examine the mechanisms of selected enzymes in some detail. Of par¬ 
ticular interest will be the enzymes that play the most central roles in con- 
verting one type of chemical energy into another. 

16.0.1 Glucose Is an Important Fuel for Most Organisms 

Y Glucose is an important and common fuel. In mammals, glucose is 
the only fuel that the brain uses under nonstarvation conditions and 
the only fuel that red blood cells can use at all. Indeed, almost all organisms 
use glucose, and most that do process it in a similar fashion. Recall from 
Chapter 11 that there are many carbohydrates. Why is glucose instead of 
some other monosaccharide such a prominent fuel? We can speculate on the 
reasons. First, glucose is one of the monosaccharides formed from formalde- 
hyde under prebiotic conditions, so it may have been available as a fuel source 
for primitive biochemical systems. Second, glucose has a low tendency, rel- 
ative to other monosaccharides, to nonenzymatically glycosylate proteins. In 
their open-chain (carbonyl) forms, monosaccharides can react with the amino 
groups of proteins to form Schiff bases, which rearrange to form a more sta- 
ble amino ketone linkage. Such nonspecifically modified proteins often do 
not function effectively. Glucose has a strong tendency to exist in the ring 
formation and, consequently, relatively little tendency to modify proteins. 
Recall that all the hydroxyl groups in the ring conformation of j8-glucose are 
equatorial, contributing to this high relative stability (Section ll.x.x). 

16.0.2 Fermentations Provide Usable Energy in the 
Absence of Oxygen 

Although glycolysis is a nearly universal process, the fate of its end prod- 
uct, pyruvate, may vary in different organisms or even in different tissues. 
In the presence of oxygen, the most common situation in multicellular or¬ 
ganisms and many unicellular ones, pyruvate is metabolized to carbon diox- 
ide and water through the citric acid cycle and the electron-transport chain. 
In the absence of oxygen, fermentation generates a lesser amount of energy; 
pyruvate is converted, or fermented, into lactic acid in lactic acid fermen¬ 
tation or into ethanol in alcoholic fermentation (Figure 16.1). Lactic acid 
production takes place in skeletal muscle when energy needs outpace the 
ability to transport oxygen. Although we will consider only these two fer- 











FERMENTATION COMPLETE 



mentations, microorganisms are capable of generating a 
wide array of molecules as end points to fermentation 
(Table 16.1). Indeed, many food products are the result 
of fermentations. These foods include sour cream, 
yogurt, various cheeses, beer, wine, and sauerkraut. 

Fermentations yield only a fraction of the energy 
available from the complete combustion of glucose. Why 
is a relatively inefficient metabolic pathway so exten- 
sively used? The fundamental reason is that oxygen is not 
required. The ability to survive without oxygen affords 
a host of living accommodations such as soils, deep wa- 
ter, and skin pores. Some organisms, called obligate 
anaerobes, cannot survive in the presence of O 2 , a highly 
reactive compound. The bacterium Clostridium perfrin- 
gens, the cause of gangrene, is an example of an obligate 
anaerobe. Other pathogenic obligate anaerobes are listed 
in Table 16.2. 

Facultative anaerobes can function in the presence or 
absence of oxygen. For instance, organisms that live in 
the intertidal zone, such as the bivalve Mytilus (Figure 16.2), can function aer- 
obically, using gills when they are under water and anaerobically when exposed 
to the air. Such organisms display habitat-dependent anaerobic functioning, or 
habitat-dependent anaerobiosis. Muscles in most animais display activity-de- 
pendent anaerobiosis, meaning that they can function anaerobically for short pe- 
riods. For example, when animais perform bursts of intense exercise, their ATP 
needs rise faster than the ability of the body to provide oxygen to the muscle. 


TABLE 16.1 Starting and ending points 
of various fermentations 


Glucose 

Lactate 

Glucose 

Ethanol 

Arginine 

Pyrimidines 

Purines 

Ethylene glycol 
Threonine 
Leucine 
Phenylalanine 


lactate 

acetate 

ethanol 

acetate 

carbon dioxide 

carbon dioxide 

formate 

acetate 

propionate 

2-alkylacetate 

propionate 


Note: The products of some fermentations are the substrates 
for others. 


r 


TABLE 16.2 Examples of pathogenic obligate anaerobes 


Bacterium 

Results of infection 

Clostridium tetani 

Tetanus (lockjaw) 

Clostridium botulinum 

Botulism (an especially severe type of food 
poisoning) 

Clostridium perfringens 

Gas gangrene (gas is produced as an 
end point of the fermentation, distorting and 
destroying the tissue) 

Bartonella hensela 

Gat scratch fever (flulike symptoms) 

Bacteroides fragilis 

Abdominal, pelvic, pulmonary, and blood 
infections 



FIGURE 16.2 The bivalve Myfi/i/s. 

These mussels, inhabitants of the intertidal 
zone, display habitat-dependent anaero¬ 
biosis. [Ed Reschke/Peter Arnold.] 
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DHAP 



Stage 1 of glycolysis. The three steps 
of stage 1 begin with the phosphorylation 
of glucose by hexokinase. 


The muscle functions anaerobically until the lactic acid builds up to the point 
at which the fali in pH inhibits the anaerobic pathway (Section 16.2.1). 


16.1 GLYCOLYSIS IS AN ENERGY-CONVERSION PATHWAY 
IN MANY ORGANISMS 

We now start our consideration of the glycolytic pathway. This pathway is 
common to virtually all cells, both prokaryotic and eukaryotic. In eukaryotic 
cells, glycolysis takes place in the cytosol. This pathway can be thought of as 
comprising three stages (Figure 16.3). Stage 1, which is the conversion of glu¬ 
cose into fructose 1,6-bisphosphate, consists of three steps: a phosphorylation, 
an isomerization, and a second phosphorylation reaction. The strategy of these 
initial steps in glycolysis is to trap the glucose in the cell and form a compound 
that can he readily cleaved into phosphorylated three-carbon units. Stage 2 is the 
cleavage of the fructose 1,6-bisphosphate into two three-carbon fragments. 
These resulting three-carbon units are readily interconvertible. In stage 3, 
ATP is harvested when the three-carbon fragments are oxidized to pyruvate. 


16.1.1 Hexokinase Traps Glucose in the Cell and 
Begins Glycolysis 

Glucose enters cells through specific transport proteins (Section 16.2.4) and 
has one principal fate: it is phosphorylated by ATP to form glucose 6-phosphate. 
This step is notable for two reasons: (1) glucose 6-phosphate cannot diffuse 
through the membrane, because of its negative charges, and (2) the addition 
of the phosphoryl group begins to destabilize glucose, thus facilitating its fur- 
ther metabolism. The transfer of the phosphoryl group from ATP to the hy- 
droxyl group on carbon 6 of glucose is catalyzed by hexokinase. 




Glucose e-phosphate 
(C-6P) 


Phosphoryl transfer is a fundamental reaction in biochemistry and is one 
that was discussed in mechanistic and structural detail earlier (Section 
9.4.1). Kinases are enzymes that catalyze the transfer of a phosphoryl group 
from ATP to an acceptor. Hexokinase, then, catalyzes the transfer of a phos¬ 
phoryl group from ATP to a variety of six-carbon sugars (hexoses), such as 
glucose and mannose. Hexokinase, like adenylate kinase (Section 9.4.2) and 
all other kinases, requires Mg^^ (or another divalent metal ion such as Mn^^) 
for activity. The divalent metal ion forms a complex with ATP. 

The results of x-ray crystallographic studies of yeast hexokinase revealed 
that the binding of glucose induces a large conformational change in the en- 
zyme, analogous to the conformational changes undergone by NMP kinases 
on substrate binding (Section 9.4.3). Hexokinase consists of two lobes, 
which move toward each other when glucose is bound (Figure 16.4). On 
glucose binding, one lobe rotates 12 degrees with respect to the other, re¬ 
sulting in movements of the polypeptide backbone of as much as 8 À. The 
cleft between the lobes doses, and the bound glucose becomes surrounded 
by protein, except for the hydroxyl group of carbon 6, which will accept the 
phosphoryl group from ATP. The closing of the cleft in hexokinase is a strik- 
ing example of the role of induced fit in enzyme action (Section 8.2.2). 










16-5 


Glycolysis 



FIGURE 16.3 Stages of glycolysis. The 

glycolytic pathway can be divided into 
three stages: (1) glucose is trapped and 
destabilized; (2) two interconvertible 
three-carbon molecules are generated by 
cleavage of six-carbon fructose; and 
(3) ATP is generated. 



+ Glucose 


Glucose 


FIGURE 16.4 Induced fít ín hex- 
okinase. As shown in blue, the two 
lobes of hexokinase are separated in the 
absence of glucose. The conformation of 
hexokinase changes markedly on binding 
glucose, as shown in red. The two lobes 
of the enzyme come together and sur- 
round the substrate. 
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The glucose-induced structural changes are significant in two respects. 
First, the environment around the glucose becomes much more nonpolar, 
which favors the donation of the terminal phosphoryl group of ATP. Sec- 
ond, as noted in Section 9.4.3, the substrate-induced conformational changes 
within the kinase enables it to discriminate against H 2 O as a substrate. If 
hexokinase were rigid, a molecule of H 2 O occupying the binding site for the 
-CH 2 OH of glucose would attack the 7 phosphoryl group of ATP, form- 
ing ADP and Pi. In other words, a rigid kinase would necessarily also be an 
ATPase. It is interesting to note that other kinases taking part in glycoly¬ 
sis—pyruvate kinase, phosphoglycerate kinase, and phosphofructokinase— 
also contain clefts between lobes that dose when substrate is bound, al- 
though the structures of these enzymes are different in other regards. 
Substrate-induced cleft closing is a general feature of kinases. 


16.1.2 The Formation of Fructose 1,6-bisphosphate 
from Glucose 6-phosphate 

The next step in glycolysis is the isomerization of glucose 6-phosphate to fruc¬ 
tose 6-phosphate. Recall that the open-chain form of glucose has an alde- 
hyde group at carbon 1 , whereas the open-chain form of fructose has a keto 
group at carbon 2 . Thus, the isomerization of glucose 6 -phosphate to fruc¬ 
tose 6 -phosphate is a conversion of an aldose into a ketose. The reaction cat- 
alyzed by phosphoglucose isomerase includes additional steps because both 
glucose 6 -phosphate and fructose 6 -phosphate are present primarily in the 
cyclic forms. The enzyme must first open the six-membered ring of glucose 
6 -phosphate, catalyze the isomerization, and then promote the formation of 
the five-membered ring of fructose 6 -phosphate. 


CH 20 P 032 - 

H C OH 

0 . .CH 2 OH 

K OH A 

HO-C—H 

HO-C—H 

Hr\ /oh 

H-C—OH 

H-C—OH 

HÓH 

H-C—OH 

CH 2 OPO 32 - 

H-C—OH 

CH 2 OPO 32 - 

Glucose 6-phosphate 

Glucose 6-phosphate 

Fructose 6-phosphate 

(G-6P) 

(Open Chain form) 

(Open Chain form) 


2-O3POH2C CH2OH 



HO H 


Fructose 6-phosphate 
(F-6P) 


A second phosphorylation reaction follows the isomerization step. Fruc¬ 
tose 6-phosphate is phosphorylated by ATP to fructose 1,6-bisphosphate 
(F- 1 , 6 -BP). The prefix bis- in bisphosphate means that two separate mono- 
phosphate groups are present, whereas the prefix di- in diphosphate (as in 
adenosine diphosphate) means that two phosphate groups are present and 
are connected by an anhydride bond. 


2-03.0H2( 

OH 

Fructose 6-phosphate 
(F-6P) 



+ ATP 


Phosphofructokinase 


2-O3Í OH2C CH2OPO32- 



+ ADP + H+ 


Fructose 1,6-bisphosphate 
(F-1,6-BP) 


This reaction is catalyzed by phosphofructokinase (PFK), an allosteric en¬ 
zyme that sets the pace of glycolysis (Section 16.2.1). As we will learn, this 
enzyme plays a central role in the integration of much of metabolism. 











Glucose 


16.1.3 The Six-Carbon Sugar Is Cleaved 
into Two Three-Carbon Fragments by Aldolase 

The second stage of glycolysis begins with the splitting of fructose 1,6-bis- 
phosphate into glyceraldehyde 3-phosphate (GAP) and dihydroxyacetone 
phosphate (DHAP). The products of the remaining steps in glycolysis con- 
sist of three-carbon units rather than six-carbon units. 


0^^/.CH20P032- 

HO-C—H 

H—C— OH 
H—C—OH 

CH2OPO32- 

Fructose 

1,6-bisphosphate 

(F-1,6-BP) 


Aldolase 


o. ^CHjOPOj^- 

HO-C—H 


H 

+ 



H—C—OH 
CH20P032" 


Dihydroxyacetone 

phosphate 

(DHAP) 


Glyceraldehyde 

3-phosphate 

(GAP) 


This reaction is catalyzed by aldolase. This enzyme derives its name from 
the nature of the reverse reaction, an aldol condensation. The reaction cat¬ 
alyzed by aldolase is readily reversible under intracellular conditions. 


ATP 



1 ^ STAGE 2 F-1,6-BP A 


4 ■ 



PEP 



V Pyruvate 


Stage 2 of glycolysis. Two three-carbon 
fragments are produced from one six- 
carbon sugar. 


16.1.4 Triose phosphate isomerase Salvages 
a Three-Carbon Fragment 


Glyceraldehyde 3-phosphate is on the direct pathway of gly¬ 
colysis, whereas dihydroxyacetone phosphate is not. Unless 
a means exists to convert dihydroxyacetone phosphate into 
glyceraldehyde 3-phosphate, a three-carbon fragment useful 
for generating ATP will be lost. These compounds are isomers 
that can be readily interconverted: dihydroxyacetone phos¬ 
phate is a ketose, whereas glyceraldehyde 3-phosphate is an 
aldose. The isomerization of these three-carbon phosphory- 
lated sugars is catalyzed by triose phosphate isomerase (TIM; 
Figure 16.5). This reaction is rapid and reversible. At equili- 
brium, 96% of the triose phosphate is dihydroxyacetone phos¬ 
phate. However, the reaction proceeds readily from dihydrox¬ 
yacetone phosphate to glyceraldehyde 3-phosphate because 
the subsequent reactions of glycolysis remove this product. 

H V 

— OH Triose phosphate V 

/ isomerase I 

0=C V ^ H C— OH 



^HjOPOjZ- 

Dihydroxyacetone 

phosphate 


CH2OPO32 

Glyceraldehyde 

3-phosphate 


Much is known about the catalytic mechanism of triose phosphate iso¬ 
merase. TIM catalyzes the transfer of a hydrogen atom from carbon 1 to 
carbon 2 in converting dihydroxyacetone phosphate into glyceraldehyde 
3-phosphate, an intramolecular oxidation-reduction. This isomerization of 
a ketose into an aldose proceeds through an enediol intermediate (Figure 16.6). 

X-ray crystallographic and other studies showed that glutamate 165 (see 
Figure 16.5) plays the role of a general acid-base catalyst. However, this 




FIGURE 16.5 Structure of triose 
phosphate isomerase. This enzyme 
consists of a central core of eight parallel 
jS strands (orange) surrounded by eight a 
helices (blue). This structural motif, called 
an ajS barrei, is aiso found in the glycolytic 
enzymes aldolase, enolase, and pyruvate 
kinase. Histidine 95 and glutamate 165, 
essential components of the active site of 
triose phosphate isomerase, are located in 
the barrei. A loop (red) doses off the ac¬ 
tive site on substrate binding. 













Dihydroxyacetone 

phosphate 


His 95 


Enediol 

intermediate 



Glyceraldehyde 

3-phosphate 





FIGURE 16.6 Catalytic mechanísm of 
tríose phosphate ísomerase. Glutamate 
165 transfers a proton between carbons 
with the assistance of histidine 95, which 
shuttles between the neutral and relatively 
rare negatively charged form. The latter is 
stabilized by interactions with other parts 
of the enzyme. 


carboxylate group by itself is not basic enough to pull a proton away from 
a carbon atom adjacent to a carbonyl group. Histidine 95 assists catalysis 
by donating a proton to stabilize the negative charge that develops on the 
C-2 carbonyl group. 

Two features of this enzyme are noteworthy. First, TIM displays great 
catalytic prowess. It accelerates isomerization by a factor of 10^^ compared 
with the rate obtained with a simple base catalyst such as acetate ion. Indeed, 
the fecat/í^M ratio for isomerization of glyceraldehyde 3-phosphate is 2 X 
10 ^ s~\ which is dose to the diffusion-controlled limit. In other words, 

the rate-limiting step in catalysis is the diffusion-controlled encounter of sub- 
strate and enzyme. TIM is an example of a kinetically perfect enzyme (Section 
8.2.5). Second, TIM suppresses an undesired side reaction, the decomposi- 
tion of the enediol intermediate into methyl glyoxal and inorganic phosphate. 



Enediol intermediate Methyl glyoxal 


In solution, this physiologically useless reaction is 100 times as fast as iso¬ 
merization. Hence, TIM must prevent the enediol from leaving the enzyme. 
This labile intermediate is trapped in the active site by the movement of a 
loop of 10 residues (see Figure 16.5). This loop serves as a lid on the active 
site, shutting it when the enediol is present and reopening it when isomer¬ 
ization is completed. We see here a striking example not only of catalytic per- 
fection, hut also of the acceleration of a desirable reaction so that it takes place 
muchfaster than an undesirable alternative reaction. Thus, two molecules of 
glyceraldehyde 3-phosphate are formed from one molecule of fructose 
1 ,6-bisphosphate by the sequential action of aldolase and triose phosphate 
Ísomerase. The economy of metabolism is evident in this reaction sequence. 
The Ísomerase funnels dihydroxyacetone phosphate into the main glycolytic 
pathway—a separate set of reactions is not needed. 











16.1.5 Energy Transformation: Phosphorylation Is Coupled 
to the Oxidation of Glyceraldehyde 3-phosphate 
by a Thioester Intermediate 

The preceding steps in glycolysis have transformed one molecule of glucose 
into two molecules of glyceraldehyde 3-phosphate, but no energy has yet 
been extracted. On the contrary, thus far two molecules of ATP have been 
invested. We come now to a series of steps that harvest some of the energy 
contained in glyceraldehyde 3-phosphate. The initial reaction in this se- 
quence is the conversion of glyceraldehyde 3-phosphate into f3-bisphospho- 
glycerate (1,3-BPG), a reaction catalyzed by glyceraldehyde 3-phosphate de- 
hydrogenase (Figure 16.7). 
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H C—OH -H NAD+ -H Pi V ^ H-C—OH -H NAD -H H+ 

CH2OPO32- CH2OPO32- 


Glyceraldehyde 

3-phosphate 

(GAP) 


1,3-Bisphosphoglycerate 
(1,3-BPG) 


1,3-Bisphosphoglycerate is an acyl phosphate. Such compounds have a high 
phosphoryl-transfer potential; one of its phosphoryl groups is transferred to 
ADP in the next step in glycolysis. The reaction catalyzed by glyceralde¬ 
hyde 3-phosphate dehydrogenase is really the sum of two processes: the oxi¬ 
dation of the aldehyde to a carboxylic acid by NAD^ and the joining of the 
carboxylic acid and orthophosphate to form the acyl-phosphate product. 
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Stage 3 of glycolysis. The oxidation of 
three-carbon fragments yields ATP. 
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FIGURE 16.7 Structure of glyc¬ 
eraldehyde 3-phosphate dehydroge¬ 
nase. The active site includes a cysteine 
residue and a histidine residue adjacent to 
a bound NAD^. 
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FIGURE 16.8 Catalytic mechanísm of 
glyceraldehyde 3-phosphate dehydro- 
genase. The reaction proceeds through a 
thioester intermediate, which allows the 
oxidation of glyceraldehyde to be coupled 
to the phosphorylation of 3-phosphoglyc- 
erate. 


The first reaction is quite thermodynamically favorable 
with a standard free-energy change, AG°', of approximately 
— 12 kcal moP^ ( —50 kj moP^), whereas the second reac¬ 
tion is quite unfavorable with a standard free-energy change 
of the same magnitude but the opposite sign. If these two 
reactions simply took place in succession, the second reac¬ 
tion would have a very large activation energy and thus not 
take place at a biologically significant rate. These two 
processes must be coupled so that the favorable aldehyde ox¬ 
idation can be used to drive the formation of the acyl phos- 
phate. How are these reactions coupled? The key is an in¬ 
termediate, formed as a result of the aldehyde oxidation, that 
is higher in free energy than the free carboxylic acid is. This intermediate 
reacts with orthophosphate to form the acyl-phosphate product. 

Let us consider the mechanism of glyceraldehyde 3-phosphate dehy- 
drogenase in detail (Figure 16.8). In step 1, the aldehyde substrate reacts with 
the sulfhydryl group of cysteine 149 on the enzyme to form a hemithioac¬ 
etal. Step 2 is the transfer of a hydride ion to a molecule of NAD^ that is tightly 
bound to the enzyme and is adjacent to the cysteine residue. This reaction is 
favored by the deprotonation of the hemithioacetal by histidine 176. The 
Products of this reaction are the reduced coenzyme NADH and a thioester 
intermediate. This thioester intermediate has a free energy dose to that of the 
reactants. In step 3, orthophosphate attacks the thioester to form 1,3-BPG 
and free the cysteine residue. This displacement occurs only after the 
NADH formed from the aldehyde oxidation has left the enzyme and been 
replaced by a second NAD^. The positive charge on the NAD^ may help 
polarize the thioester intermediate to facilitate the attack by orthophosphate. 

This example illustrates the essence of energy transformations and of 
metabolism itself: energy released by carbon oxidation is converted into high 
phosphoryl-transfer potential. The favorable oxidation and unfavorable 




















phosphorylation reactions are coupled by the thioester intermediate, which 
preserves much of the free energy released in the oxidation reaction. We see 
here the use of a covalent enzyme-bound intermediate as a mechanism of en¬ 
ergy coupling. A free-energy profile of the glyceraldehyde 3-phosphate de- 
hydrogenase reaction, compared with a hypothetical process in which the 
reaction proceeds without this intermediate, reveals how this intermediate 
allows a favorable process to drive an unfavorable one (Figure 16.9). 




16.1.6 The Formation of ATP from 1,3-Bisphosphoglycerate 

The final stage in glycolysis is the generation of ATP from the phosphory- 
lated three-carbon metabolites of glucose. Phosphoglycerate kinase catalyzes 
the transfer of the phosphoryl group from the acyl phosphate of 1,3-bis- 
phosphoglycerate to ADR ATP and 3-phosphoglycerate are the products. 

.OPOp- O. - O 

'"c 

I Phosphoglycerate | 

H C OH + aDP H Ç—OH + aTP 

CH2OPO32- CH2OPO32- 

1,3-Bisphosphoglycerate 3-Phosphoglycerate 


The formation of ATP in this manner is referred to as substrate-levei phospho¬ 
rylation because the phosphate donor, 1,3-BPG, is a substrate with high phos¬ 
phoryl-transfer potential. We will contrast this manner of ATP formation 
with that in which ATP is formed from ionic gradients in Chapters 19 and 20. 

Thus, the outcomes of the reactions catalyzed by glyceraldehyde 
3-phosphate dehydrogenase and phosphoglycerate kinase are: 

1. Glyceraldehyde 3-phosphate, an aldehyde, is oxidized to 3-phospho¬ 
glycerate, a carboxylic acid. 

2. NAD^ is concomitantly reduced to NADH. 

3. ATP is formed from Pi and ADP at the expense of carbon oxidation 
energy. 

Keep in mind that, because of the actions of aldolase and triose phosphate 
isomerase, two molecules of glyceraldehyde 3-phosphate were formed and 
hence two molecules of ATP were generated. These ATP molecules make 
up for the two molecules of ATP consumed in the first stage of glycolysis. 

16.1.7 The Generation of Additional ATP and 
the Formation of Pyruvate 

In the remaining steps of glycolysis, 3-phosphoglycerate is converted into 
pyruvate with the concomitant conversion of ADP into ATP. 
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FIGURE 16.9 Free-energy profiles for 
glyceraldehyde oxidation followed by 
acyl-phosphate formation. (A) A hypo¬ 
thetical case with no coupling between 
the two processes. The second step must 
have a large activation barrier, making the 
reaction very slow. (B) The actual case 
with the two reactions coupled through a 
thioester intermediate. 
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The first reaction is a rearrangement. The position of the phosphoryl group 
shifts in the conversion of 3-phosphoglycerate into 2-phosphoglycerate, a re¬ 
action catalyzed hy phosphoglycerate mutase. In general, a mutase is an enzyme 
that catalyzes the intramolecular shift of a chemical group, such as a phos¬ 
phoryl group. The phosphoglycerate mutase reaction has an interesting 
mechanism: the phosphoryl group is not simply moved from one carbon to 
another. This enzyme requires catalytic amounts of 2,3-bisphosphoglycer- 
ate to maintain an active-site histidine residue in a phosphorylated form. 

Enz-His + 2,3 bisphosphoglycerate ^ 

Enz-His-phosphate + 2-phosphoglycerate 

Enz-His-phosphate + 3-phosphoglycerate ^ 

Enz-His + 2,3-bisphosphoglycerate 

The sum of these reactions yields the mutase reaction: 

3-Phosphoglycerate ^ 2-phosphoglycerate 

Examination of the first partial reaction reveals that the mutase functions 
as a phosphatase—it converts 2,3-bisphosphoglycerate into 2-phospho¬ 
glycerate. However, the phosphoryl group remains linked to the enzyme. 
This phosphoryl group is then transferred to 3-phosphoglycerate to reform 
2,3 - bisphosphoglycerate. 

In the next reaction, an enol is formed by the dehydration of 2-phos¬ 
phoglycerate. Enolase catalyzes the formation oíphosphoenolpyruvate (PEP). 
This dehydration markedly elevates the transfer potential of the phospho¬ 
ryl group. An enol phosphate has a high phosphoryl-transfer potential. 


TABLE 16.3 Reactions of glycolysis 


Step Reaction 

Enzyme 


9 

10 


Glucose + ATP —> glucose 6-phosphate + ADP + H"^ 

Glucose 6-phosphate ^ fructose 6-phosphate 
Fructose 6-phosphate + ATP —> 

fructose 1,6-bisphosphate + ADP + H"^ 
Fructose 1,6-bisphosphate ^ 

dihydroxyacetonephosphate + glyceraldehyde 3-phosphate 
Dihydroxyacetone phosphate ^ glyceraldehyde 3-phosphate 
Glyceraldehyde 3-phosphate +Pi + NAD"^ ^ 


1,3-bisphosphoglycerate + NADH + H"^ 
1,3-Bisphosphoglycerate + ADP ^ 3-phosphoglycerate + ATP 

3-Phosphoglycerate ^ 2-phosphoglycerate 

2-Phosphoglycerate ^ phosphoenolpyruvate +H 2 O 

Phosphoenolpyruvate + ADP + H"^—> pyruvate + ATP 


Hexokinase 

Phosphoglucose isomerase 
Phosphofructokinase 

Aldolase 

Triose phosphate isomerase 
Glyceraldehyde 3-phosphate 
dehydrogenase 
Phosphoglycerate kinase 
Phosphoglycerate mutase 
Enolase 

Pyruvate kinase 


Note: AG, the actual free-energy change, has been calculated from AG°' and known con- 
centrations of reactants under typicai physiologic conditions. Glycolysis can proceed only 
if the AG values of all reactions are negative. The small positive AG values of three of the 



























whereas the phosphate ester, such as 2-phosphoglycerate, of an ordinary al- 
cohol has a low one. The AG°' of the hydrolysis of a phosphate ester of an 
ordinary alcohol is —3 kcal moP^ ( — 13 kj moP^), whereas that of phos- 
phoenolpyruvate is —14.8 kcal moP^ (— 62 kJ moP^). Why does phos- 
phoenolpyruvate have such a high phosphoryl-transfer potential? The phos- 
phoryl group traps the molecule in its unstable enol form. When the 
phosphoryl group has been donated to ATP, the enol undergoes a conver- 
sion into the more stable ketone—namely, pyruvate. 



Phosphenolpyruvate Pyruvate Pyruvate 

(enol form) 


Thus, the high phosphoryl-transfer potential of phosphoenolpyruvate arises 
primarily from the large driving force of the subsequent enol—ketone conver- 
sion. Hence, pyruvate is formed, and ATP is generated concomitantly. The 
virtually irreversible transfer of a phosphoryl group from phosphoenolpyru¬ 
vate to ADP is catalyzed by pyruvate kinase. Because the molecules of ATP 
used in forming fructose 1,6-bisphosphate have already been regenerated, 
the two molecules of ATP generated from phosphoenolpyruvate are ‘‘profit.’' 

16.1.8 Energy Yield in the Conversion of Glucose into Pyruvate 

The net reaction in the transformation of glucose into pyruvate is: 

Glucose + 2 P^ + 2 ADP + 2 NAD^-> 

2 pyruvate + 2 ATP + 2 NADH + 2 + 2 H 2 O 

Thus, two molecules of ATP are generated in the conversion of glucose into 
two molecules of pyruvate. The reactions of glycolysis are summarized in 
Table 16.3. 



AG”' in 

AG in 


kcal moP^ 

kcal mol”^ 

Reaction type 

(kJ moP^) 

(kJ moP^) 

Phosphoryl transfer 

-4.0 (-16.7) 

-8.0 (-33.5) 

Isomerization 

+0.4 ( + 1.7) 

-0.6 (-2.5) 

Phosphoryl transfer 

-3.4 (-14.2) 

-5.3 (-22.2) 

Aldol cleavage 

+ 5.7 ( + 23.8) 

-0.3 (-1.3) 

Isomerization 

+ 1.8 ( + 7.5) 

+0.6 (+2.5) 

Phosphorylation coupled 
to oxidation 

+ 1.5 ( + 6.3) 

+0.6 (+2.5) 

Phosphoryl transfer 

-4.5 (-18.8) 

+0.3 ( + 1.3) 

Phosphoryl shift 

+ 1.1 (+4.6) 

+0.2 (+0.8) 

Dehydration 

+ 0.4 ( + 1.7) 

-0.8 (-3.3) 

Phosphoryl transfer 

-7.5 (-31.4) 

-4.0 (-16.7) 


above reactions indicate that the concentrations of metabolites in vivo in cells undergoing 
glycolysis are not precisely known. 
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Note that the energy released in the anaerobic conversion of glucose into 
two molecules of pyruvate is —47 kcal moP^ (— 197 kj moP^). We shall see 
in Chapters 18 and 19 how much more energy can be released from glucose 
in the presence of oxygen. 

CONCEPTUAL INSIGHTS, Energetics of Glucose Metabolism. See the section 
on the energetics of glycolysis in the Conceptual Insights module for a graphical repre- 
sentation of free energy differences among glycolytic metabolites, and how these differ- 
ences are used to drive ATP and NADH synthesis in coupled reactions. 
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16.1.9 Maintaining Redox Balance: The Diverse 
Fates of Pyruvate 

The conversion of glucose into two molecules of pyruvate has resulted in 
the net synthesis of ATP. However, an energy-converting pathway that 
stopped at pyruvate would not proceed for long, because redox balance 
has not been maintained. As we have seen, the activity of glyceraldehyde 
3-phosphate dehydrogenase, in addition to generating a compound with high 
phosphoryl-transfer potential, of necessity leads to the reduction of NAD^ 
to NADH. There are limited amounts of NAD^ in the cell, which is derived 
from the vitamin niacin, a dietary requirement in human beings. Conse- 
quently, NAD^ must be regenerated for glycolysis to proceed. Thus, the fi¬ 
nal process in the pathway is the regeneration of NAD^ through the me¬ 
tabolism of pyruvate. The sequence of reactions from glucose to pyruvate is 
similar in most organisms and most types of cells. In contrast, the fate of 
pyruvate is variable. Three reactions of pyruvate are of prime importance: 
conversion into ethanol, lactic acid, or carbon dioxide (Figure 16.10). 
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FIGURE 16.10 Diverse fates of pyruvate. Ethanol and lactate can be formed by 
reactions involving NADH. Alternatively, a two-carbon unit from pyruvate can be coupled 
to coenzyme A (see Section 18.x.x) to form acetyl CoA. 


1. Ethanol is formed from pyruvate in yeast and several other microorganisms. 
The first step is the decarboxylation of pyruvate. This reaction is catalyzed 
by pyruvate decarboxylase, which requires the coenzyme thiamine py- 
rophosphate. This coenzyme, derived from the vitamin thiamine (Bi), also 
participates in reactions catalyzed by other enzymes (Section 17.x.x). The 
second step is the reduction of acetaldehyde to ethanol by NADH, in a re¬ 
action catalyzed by alcohol dehydrogenase. This process regenerates NAD^. 
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The active site of alcohol dehydrogenase contains a zinc ion that is coordi- 
nated to the sulfur atoms of two cysteine residues and a nitrogen atom of 
histidine (Figure 16.11). This zinc ion polarizes the carbonyl group of the 
substrate to favor the transfer of a hydride from NADH. 

The conversion of glucose into ethanol is an example of alcoholic fer- 
mentation. The net result of this anaerobic process is: 

Glucose + 2 Pi + 2 ADP + 2 -> 

2 ethanol + 2 CO 2 + 2 ATP + 2 H 2 O 

Note that NAD^ and NADH do not appear in this equation, even though 
they are crucial for the overall process. NADH generated by the oxidation 
of glyceraldehyde 3-phosphate is consumed in the reduction of acetalde- 
hyde to ethanol. Thus, there is no net oxidation-reduction in the conversion 
of glucose into ethanol (Figure 16.12). The ethanol formed in alcoholic fer- 
mentation provides a key ingredient for brewing and winemaking. 

2. Lactate is formed from pyruvate in a variety of microorganisms in a 
process called lactic acid fermentation. The reaction also takes place in the 
cells of higher organisms when the amount of oxygen is limiting, as in mus- 
cle during intense activity. The reduction of pyruvate by NADH to form 
lactate is catalyzed by lactate dehydrogenase. 

HO-C- H 

CH 3 

Lactate 

The overall reaction in the conversion of glucose into lactate is: 

Glucose + 2 Pi + 2 ADP-> 2 lactate + 2 ATP + 2 H 2 O 

As in alcoholic fermentation, there is no net oxidation-reduction. The 
NADH formed in the oxidation of glyceraldehyde 3-phosphate is consumed 
in the reduction of pyruvate. The regeneration of NAD^in the reduction of 
pyruvate to lactate or ethanol sustains the continued operation of glycolysis un- 
der anaerobic conditions. 

3. Only a fraction of the energy of glucose is released in its anaerobic con¬ 
version into ethanol or lactate. Much more energy can be extracted aerobic- 
ally by means of the citric acid cycle and the electron-transport chain. The 
entry point to this oxidative pathway is acetyl coenzyme A (acetyl CoA), 
which is formed inside mitochondria by the oxidative decarboxylation of 
pyruvate. 

Pyruvate + NAD^ + CoA-> acetyl CoA + CO 2 + NADH 


NADH 



Pyruvate 



FIGURE 16.11 Active site of alcohol 
dehydrogenase. The active site contains 
a zinc ion bound to two cysteine residues 
and one histidine residue. The zinc ion 
binds the acetaldehyde substrate through 
its oxygen atom, polarizing it so that it 
more easily accepts a hydride (light blue) 
from NADH. Only the nicotinamide ring of 
NADH is shown. 


FIGURE 16.12 Maíntaíníng redox 
balance. The NADH produced by the 
glyceraldehyde 3-phosphate dehydroge¬ 
nase reaction must be reoxidized to NAD^ 
for the glycolytic pathway to continue. 

In alcoholic fermentation, alcohol dehydro¬ 
genase oxidizes NADH and generates 
ethanol. In lactic acid fermentation (not 
shown), lactate dehydrogenase oxidizes 
NADH while generating lactic acid. 
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FIGURE 16.13 NAD^-bindíng 
regíon ín dehydrogenases. The nico- 
tinamide-binding half (yellow) is struc- 
turally similar to the adenine-binding half 
(red). The two halves together form a 
structural motif called a Rossmann fold. 
The NAD^ molecule binds in an extended 
conformation. 


This reaction, which is catalyzed by the pyruvate dehydrogenase complex, 
will be discussed in detail in Chapter 18. The NAD^ required for this re¬ 
action and for the oxidation of glyceraldehyde 3-phosphate is regenerated 
when NADH ultimately transfers its electrons to O 2 through the electron- 
transport chain in mitochondria. 


16.1.10 The Binding Site for NAD^ Is Similar in 
Many Dehydrogenases 


55^[l Although the enzymes taking part in glycolysis and the subsequent 


conversion of pyruvate are structurally diverse, the three dehydro¬ 
genases—glyceraldehyde 3-phosphate dehydrogenase, alcohol dehydroge¬ 
nase, and lactate dehydrogenase—have in common a domain for NAD^ bind¬ 
ing (Figure 16.13). This nucleotide-binding region is made up of four a 
helices and a sheet of six parallel (3 strands. Moreover, in all cases, the bound 
NAD^ displays nearly the same conformation. This common structural do¬ 
main, one of the first recurring structural domains to be discovered, is of- 
ten called a Rossmann fold after Michael Rossmann, who first recognized it. 
This fold likely represents a primordial dinucleotide-binding domain that 
recurs in the dehydrogenases of glycolysis and other enzymes because of 
their descent from a common ancestor. 


Nicotinamide-binding half 



16.1.11 The Entry of Fructose and Galactose into Glycolysis 

Although glucose is the most widely used monosaccharide, others also are 
important fuels. Let us consider how two abundant sugars—fructose and 
galactose—can be funneled into the glycolytic pathway (Figure 16.14). 
Much of the ingested fructose is metabolized by the liver, using the fruct¬ 
ose 1-phosphate pathway (Figure 16.15). The first step is the phosphoryla- 
tion of fructose to fructose 1 -phosphate hyfructokinase. Fructose 1 -phosphate 
is then split into glyceraldehyde and dihydroxyacetone phosphate, an inter- 
mediate in glycolysis. This aldol cleavage is catalyzed by a specific /ructose 
1 -phosphate aldolase. Glyceraldehyde is then phosphorylated to glyceralde¬ 
hyde 3-phosphate, a glycolytic intermediate, by triose kinase. Alternatively, 
fructose can be phosphorylated to fructose 6-phosphate by hexokinase. How- 
ever, the affinity of hexokinase for glucose is 20 times as great as it is for 
fructose. Little fructose 6-phosphate is formed in the liver because glucose 



is so much more abundant in this organ. Moreover, glu¬ 
cose, as the preferred fuel, is also trapped in the muscle 
by the hexokinase reaction. Because liver and muscle 
phosphorylate glucose rather than fructose, adipose tis- 
sue is exposed to more fructose than glucose. Hence, the 
formation of fructose 6-phosphate is not competitively 
inhibited to a biologically significant extent, and most of 
the fructose in adipose tissue is metabolized through 
fructose 6-phosphate. 

There are no catabolic pathways to metabolizega/act- 
ose, so the strategy is to convert galactose into a metabo- 
lite of glucose. Galactose is converted into glucose 6-phos¬ 
phate in four steps. The first reaction in the galactose— 
glucose interconversion pathway is the phosphorylation of 
galactose to galactose 1 -phosphate by galactokinase. 
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FIGURE 16.14 Entry points in 
glycolysís for galactose and fructose. 


Galactose 1-phosphate then acquires a uridyl group from uridine diphos- 
phate glucose (UDP-glucose), an intermediate in the synthesis of glycosidic 
linkages (Section 21.x.x). 
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FIGURE 16.15 Fructose metabolism. 

Fructose enters the glycolytic pathway in 
the liver through the fructose 1-phosphate 
pathway. 
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The Products of this reaction, which is catalyzed by galactose 1 -phosphate 
uridyl transferase, are UDP-galactose and glucose 1 -phosphate. The galactose 
moiety of UDP-galactose is then epimerized to glucose. The configuration 
of the hydroxyl group at carbon 4 is inverted by UDP-galactose 4-epimerase. 

The sum of the reactions catalyzed by galactokinase, the transferase, and 
the epimerase is: 

Galactose + ATP-> glucose 1 -phosphate + ADP + 

Note that UDP-glucose is not consumed in the conversion of galactose into 
glucose, because it is regenerated from UDP-galactose by the epimerase. 
This reaction is reversible, and the product of the reverse direction also is 
important. The conversion of UDP-glucose into UDP-galactose is essential 
for the synthesis of galactosyl residues in complex polysaccharides and glyco- 
proteins if the amount of galactose in the diet is inadequate to meet these needs. 

Finally, glucose 1 -phosphate, formed from galactose, is isomerized to 
glucose 6 -phosphate hy phosphoglucomutase. We shall return to this reaction 
when we consider the synthesis and degradation of glycogen, which pro- 
ceeds through glucose 1 -phosphate, in Chapter 21. 

16.1.12 Many Aduits Are Intolerant of Milk Because They Are 
Deficient in Lactase 

7 Many aduits are unable to metabolize the milk sugar lactose and ex- 
perience gastrointestinal disturbances if they drink milk. Lactose in- 
tolerance, or hypolactasia, is most commonly caused by a deficiency of the 
enzyme lactase, which cleaves lactose into glucose and galactose. 



+ H2O 


Lactase 




Scanning electron micrograph of 
Lactobadilus. The anaerobic bactéria Lac- 
tobacillus is shown here (artificially col- 
ored) at a magnification of 22, 245 x. As 
suggested by its name, this genus of bacté¬ 
ria ferments glucose into lactic acid, and is 
widely used in the food industry. Lacto- 
bacillus is also a component of the normal 
human bacterial flora of the urogenital 
tract where, because of its ability to gener- 
ate an acidic environment, it prevents 
growth of harmfui bactéria. [Dr. Dennis 
Kunkel/PhotoTake.] 


‘‘Deficiency'' is not quite the appropriate term, because a decrease in lac¬ 
tase is normal during development in all mammals. As children are weaned 
and milk becomes less prominent in their diets, lactase activity normally de¬ 
clines to about 5 to 10 % of the levei at birth. This decrease is not as pro- 
nounced with some groups of people, most notably Northern Europeans, 
and people from these groups can continue to ingest milk without gas¬ 
trointestinal difficulties. With the appearance of milk-producing domesti- 
cated animais, a human being with a genetic alteration endowing high 
leveis of lactase activity in adulthood would hypothetically have a selective 
advantage in being able to consume calories from the readily available milk. 

What happens to the lactose in the intestine of a lactase-deficient per- 
son? The lactose is a good energy source for microorganisms in the colon, 
and they ferment it to lactic acid while also generating methane (CH 4 ) and 
hydrogen gas (H 2 ). The gas produced creates the uncomfortable feeling of 
gut distention and the annoying problem of flatulence. The lactic acid pro¬ 
duced by the microorganisms is osmotically active and draws water into the 
intestine, as does any undigested lactose, resulting in diarrhea. If severe 
enough, the gas and diarrhea hinder the absorption of other nutrients such 
as fats and proteins. The simplest treatment is to avoid the consumption of 
Products containing much lactose. Alternatively, the enzyme lactase can be 
ingested with milk products. 







16.1.13 Galactose Is Highly Toxic If the Transferase Is Missing 

7 Less common than lactose intolerance are disorders that interfere 
with the metabolism of galactose. The disruption of galactose meta- 
bolism is referred to as galactosemia. The most common form, called classic 
galactosemia, is an inherited deficiency in galactose 1 -phosphate uridyl trans¬ 
ferase activity. Afflicted infants fail to thrive. They vomit or have diarrhea 
after consuming milk, and enlargement of the liver and jaundice are com¬ 
mon, sometimes progressing to cirrhosis. Cataracts will form, and lethargy 
and retarded mental development also are common. The blood-galactose 
levei is markedly elevated, and galactose is found in the urine. The absence 
of the transferase in red blood cells is a definitive diagnostic criterion. 

The most common treatment is to remove galactose (and lactose) from 
the diet. The enigma of galactosemia is that, although elimination of galac¬ 
tose from the diet prevents liver disease and cataract development, the ma- 
jority of patients still suffer from central nervous system malfunction, most 
commonly a delayed acquisition of language skills. Females will also dis- 
play ovarian failure. 

Cataract formation is better understood. A cataract is the clouding of the 
normally clear lens of the eye. If the transferase is not active in the lens of 
the eye, the presence of aldose reductase causes the accumulating galactose 
to be reduced to galactitol. 
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Galactitol is osmotically active, and water will diffuse into the lens, in- 
stigating the formation of cataracts. In fact, there is a high incidence of 
cataract formation with age in populations that consume substantial 
amounts of milk into adulthood. 


16.2 THE GLYCOLYTIC PATHWAY IS 
TIGHTLY CONTROLLED 

The flux through the glycolytic pathway must be adjusted in response to 
conditions both inside and outside the cell. The rate of conversion of glu¬ 
cose into pyruvate is regulated to meet two major cellular needs: (1) the pro- 
duction of ATP, generated by the degradation of glucose, and (2) the pro- 
vision of building blocks for synthetic reactions, such as the formation of 
fatty acids. In metaholic pathways, enzymes catalyzing essentially irreversible 
reactions are potential sites of control In glycolysis, the reactions catalyzed 
by hexokinase, phosphofructokinase, and pyruvate kinase are virtually ir¬ 
reversible; hence, these enzymes would be expected to have regulatory as 
well as catalytic roles. In fact, each of them serves as a control site. Their 
activities are regulated by the reversible binding of allosteric effectors or by 
covalent modification. In addition, the amounts of these important enzymes 
are varied by the regulation of transcription to meet changing metabolic 
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Gluconeogenesis seconds, and hours, respectively. 

16.2.1 Phosphofructokinase Is the Key Enzyme 
in the Control of Glycolysis 

Phosphofructokinase is the most important control element in the mammalian 
glycolytic pathway (Figure 16.16). High leveis of ATP allosterically inhibit 
the enzyme in the liver (a 340-kd tetramer), thus lowering its affinity for 
fructose 6-phosphate. A high concentration of ATP converts the hyperbolic 
binding curve of fructose 6-phosphate into a sigmoidal one (Figure 16.17). 
ATP elicits this effect by binding to a specific regulatory site that is distinct 
from the catalytic site. AMP reverses the inhibitory action of ATP, and so 
the activity of the enzyme increases when the ATP/AMP ratio is lowered. In 
other words, glycolysis is stimulated as the energy charge falis. A fali in pH 
also inhibits phosphofructokinase activity. The inhibition of phosphofruc¬ 
tokinase by prevents excessive formation of lactic acid (Section 16.1.9) 
and a precipitous drop in blood pH (acidosis). 


FIGURE 16.16 Structure of 
phosphofructokinase. Phosphofructoki¬ 
nase in the liver is a tetramer of four iden- 
tical subunits. The positions of the catalytic 
and allosteric sites are indicated. 



FIGURE 16.17 Allosteric regulation of 
phosphofructokinase. A high levei of 
ATP inhibits the enzyme by decreasing 
its affinity for fructose 6-phosphate. AMP 
diminishes and citrate enhances the in¬ 
hibitory effect of ATP. 



Why is AMP and not ADP the positive regulator of phosphofructoki¬ 
nase? When ATP is being utilized rapidly, the enzyme adenylate kinase (Sec¬ 
tion 9.4) can form ATP from ADP by the following reaction: 

ADP + ADP ATP + AMP 

Thus, some ATP is salvaged from ADP, and AMP becomes the signal for 
the low-energy State. Moreover, the use of AMP as an allosteric regulator 
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FIGURE 16.18 Activatíon of 
phosphofructokínase by fructose 
2,6-bisphosphate. (A) The sigmoidal 
dependence of velocity on substrate 
concentration becomes hyperbolic in the 
presence of 1 ijlM fructose 2,6-bisphos- 
phate. (B) ATP, acting as a substrate, 
initially stimulates the reaction. As the 
concentration of ATP increases, it acts as 
an allosteric inhibitor. The inhibitory effect 
of ATP is reversed by fructose 2,6-bisphos- 
phate. [After H.-G. Hers and E. Van Schaftingen. 
Proc. Natl. Acad. Sei. 78(1981 ):2862.] 


provides an especially sensitive control. We can understand why by con- 
sidering, first, that the total adenylate pool ([ATP], [ADP], [AMP]) in a 
cell is constant o ver the short term and, second, that the concentration of 
ATP is greater than that of ADP and the concentration of ADP is, in turn, 
greater than that of AMP. Consequently, small-percentage changes in 
[ATP] result in larger-percentage changes in the concentrations of the other 
adenylate nucleotides. This magnification of small changes in [ATP] to 
larger changes in [AMP] leads to tighter control by increasing the range of 
sensitivity of phosphofructokínase. 

Glycolysis also furnishes carbon skeletons for biosyntheses, and so a sig- 
nal indicating whether building blocks are abundant or scarce should also 
regulate phosphofructokínase. Indeed, phosphofructokinase is inhibited by 
citrate, an early intermediate in the citric acid cycle (Section IS.x.x). A high 
levei of citrate means that biosynthetic precursors are abundant and addi- 
tional glucose should not be degraded for this purpose. Citrate inhibits 
phosphofructokinase by enhancing the inhibitory effect of ATP. 

In 1980, fructose 2,6-bisphosphate (F-2,6-BP) was identified as a potent 
activator of phosphofructokinase. Fructose 2,6-bisphosphate activates 
phosphofructokinase by increasing its affinity for fructose 6-phosphate and 
diminishing the inhibitory effect of ATP (Figure 16.18). In essence, Fruc¬ 
tose 2,6-bisphosphate is an allosteric activator that shifts the conformational 
equilibrium of this tetrameric enzyme from the T State to the R State. 

16.2.2 A Regulated Bifunctional Enzyme Synthesizes and 
Degrades Fructose 2,6 -bisphosphate 

How is the concentration of fructose 2,6-bisphosphate appropriately con- 
trolled? Two enzymes regulate the concentration of this important regula- 
tor of glycolysis by phosphorylating fructose 6-phosphate and dephosphor- 
ylating fructose 2,6-bisphosphate. Fructose 2,6-bisphosphate is formed in 
a reaction catalyzed by phosphofructokinase 2 (PFK2), a different enzyme 
from phosphofructokinase. Fructose 2,6-bisphosphate is hydrolyzed to 
fructose 6-phosphate by a specific phosphatase, fructose bisphosphatase 2 
(FBPase2). The striking finding is that both PFK2 and FBPase2 are present 
in a single 55-kd polypeptide chain (Figure 16.19). This bifunctional enzyme 
contains an N-terminal regulatory domain, followed by a kinase domain and 
a phosphatase domain. PFK2 resembles adenylate kinase in having a 
P-loop NTPase domain (Sections 9.4.1 and 9.4.3), whereas FBPase2 re¬ 
sembles phosphoglycerate mutase (Section 16.1.7). Recall that the mutase 
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FIGURE 16.19 Domain structure 
of the bifunctional enzyme phospho- 
fructokínase 2. The kinase domain (pur- 
ple) is fused to the phosphatase domain 
(red). The kinase domain is a P-loop NTP 
hydrolase domain, as indicated by the pur- 
ple shading (Section 9.4.4). The bar repre- 
sents the amino add sequence of the en¬ 
zyme. 



Fructose 2,6-bisphosphate 
(stimulates PFK) 


PFK 


Fructose 6-phosphate 


470 


FIGURE 16.20 Control of the synthesis 
and degradation of fructose 2,6-bís- 
phosphate. A low blood-glucose levei as 
signaled by glucagon leads to the phos- 
phorylation of the bifunctional enzyme 
and hence to a lower levei of fructose 
2,6-bisphosphate, slowing glycolysis. High 
leveis of fructose 6-phosphate accelerate 
the formation of fructose 2,6-bisphosphate 
by facilitating the dephosphorylation of the 
bifunctional enzyme. 


GLUCOSE ABUNDANT 

(glycolysis active) 


is essentially a phosphatase. In the bifunctional enzyme, 
the phosphatase activity evolved to become specific for 
F-2,6-BP. The bifunctional enzyme itself probably arose 
by the fusion of genes encoding the kinase and phos¬ 
phatase domains. 

The bifunctional enzyme exists in five isozymic forms 
(isoforms) that differ in size and kinetics as well as im- 
munological and regulatory properties. Recall that isoen- 
zymes, or isozymes, have essentially the same architec- 
tural plan and catalytic properties but differ in how they 
are regulated. The L isoform, which predominates in the 
liver, and the M isoform, found in muscle are generated 
by alternative splicing (Section 28.x.x) of the transcrip- 
tion product of a single gene. The L isoform helps to 
maintain blood-glucose homeostasis. In the liver, the 
concentration of fructose 6-phosphate rises when blood-glucose concentra- 
tion is high, and the abundance of fructose 6-phosphate accelerates the syn¬ 
thesis of F-2,6-BP. Hence, an abundance of fructose 6-phosphate leads to a 
higher concentration of F-2,6-BP, which in turn stimulates phosphofructoki- 
nase. Such a process is cslledfeedforward stimulation. What Controls whether 
PFK2 or FBPase2 dominates the bifunctional enzyme s activities in the 
liver? The activities of PFK2 and FBPase2 are reciprocally controlled by 
phosphorylation of a single serine residue. When glucose is scarce, a rise in 
the blood levei of the hormone glucagon triggers a cyclic AMP cascade, 
through its 7TM receptor and Ggn (Section IS.x.x.), leading to the phos¬ 
phorylation of this bifunctional enzyme by protein kinase A (Figure 16.20). 
This covalent modification activates FBPase2 and inhibits PFK2, lowering 
the levei of F-2,6-BP. Thus, glucose metabolism by the liver is curtailed. 
Conversely, when glucose is abundant, the enzyme loses its attached phos- 
phate group. This covalent modification activates PFK2 and inhibits FB- 
Pase2, raising the levei of F-2,6-BP and accelerating glycolysis. This coor- 
dinated control is facilitated by the location of the kinase and phosphatase 
domains on the same polypeptide chain as the regulatory domain. We shall 
return to this elegant switch when we consider the integration of carbohy- 
drate metabolism (Section 16.4). 
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16.2.3 Hexokinase and Pyruvate kinase 
Aiso Set the Pace of Glycolysis 

Phosphofructokinase is the most prominent regulatory enzyme in glycoly¬ 
sis, but it is not the only one. Hexokinase, the enzyme catalyzing the first 
step of glycolysis, is inhibited by its product, glucose 6-phosphate. High 
concentrations of this molecule signal that the cell no longer requires glu¬ 
cose for energy, for storage in the form of glycogen, or as a source of biosyn- 
thetic precursors, and the glucose will be left in the blood. For example, 
when phosphofructokinase is inactive, the concentration of fructose 6-phos¬ 
phate rises. In turn, the levei of glucose 6-phosphate rises because it is in 
equilibrium with fructose 6-phosphate. Hence, the inhibition of phospho¬ 
fructokinase leads to the inhibition of hexokinase. However, the liver, in keep- 
ing with its role as monitor of blood-glucose leveis, possesses a specialized 
isozyme of hexokinase called glucokinase that is not inhibited by glucose 
6 -phosphate. Glucokinase phosphorylates glucose only when it is abundant 
because it has about a 50-fold affinity for glucose than does hexokinase 
(5 mM, compared with 0.1 mM). The role of glucokinase is to provide glu¬ 
cose 6-phosphate for the synthesis of glycogen, a storage form of glucose 
(Section 21.x.x), and for the formation of fatty acids (Section 22.x.x). The 
low glucose affinity of glucokinase in the liver gives the brain and muscles 
first call on glucose when its supply is limited, whereas it ensures that glu¬ 
cose will not be wasted when it is abundant. 

Why is phosphofructokinase rather than hexokinase the pacemaker of 
glycolysis? The reason becomes evident on noting that glucose 6-phosphate 
is not solely a glycolytic intermediate. Glucose 6-phosphate can also be con- 
verted into glycogen or it can be oxidized by the pentose phosphate path- 
way (Section 17.x.x) to form NADPH. The first irreversible reaction unique 
to the glycolytic pathway, the committed step, (Section 10.2), is the phos- 
phorylation of fructose 6-phosphate to fructose 1,6-bisphosphate. Thus, it 
is highly appropriate for phosphofructokinase to be the primary control site 
in glycolysis. In general, the enzyme catalyzing the committed step in a meta- 
bolic sequence is the most important control element in the pathway. 

Pyruvate kinase, the enzyme catalyzing the third irreversible step in gly¬ 
colysis, Controls the outflow from this pathway. This final step yields ATP 
and pyruvate, a central metabolic intermediate that can be oxidized further 
or used as a building block. Several isozymic forms of pyruvate kinase (a 
tetramer of 57-kd subunits) encoded by different genes are present in mam- 
mals: the L type predominates in liver, and the M type in muscle and brain. 
The L and M forms of pyruvate kinase have many properties in common. 
Both bind phosphoenolpyruvate cooperatively. Fructose 1,6-bisphosphate, 
the product of the preceding irreversible step in glycolysis, acti vates both 
isozymes to enable them to keep pace with the oncoming high flux of inter- 
mediates. ATP allosterically inhibits both the L and the M forms of pyru¬ 
vate kinase to slow glycolysis when the energy charge is high. Finally, ala- 
nine (synthesized in one step from pyruvate, Section 24.x.x) also 
allosterically inhibits the pyruvate kinases—in this case, to signal that build¬ 
ing blocks are abundant. 

The isozymic forms differ in their susceptibility to covalent modifica- 
tion. The catalytic properties of the L form—but not of the M form—are 
also controlled by reversible phosphorylation (Figure 16.21). When the 
blood-glucose levei is low, the glucagon-triggered cyclic AMP cascade (Sec¬ 
tion IS.x.x) leads to the phosphorylation of pyruvate kinase, which dimin- 
ishes its activity. These hormone-triggered phosphorylations, like that of the 
bifunctional enzyme controlling the leveis of fructose 2,6-bisphosphate, pre- 
vent the liver from consuming glucose when it is more urgently needed by brain 
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FIGURE 16.21 Control of the catalytic 
activity of pyruvate kínase. Pyruvate 
kinase is regulated by allosteric effectors 
and covalent modification. 
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anà muscle (Section 30.x.x). We see here a clear-cut example of how isoen- 
zymes contribute to the metabolic diversity of different organs. We will re- 
turn to the control of glycolysis after considering gluconeogenesis. 

16.2.4 A Family of Transporters Enables Glucose 
to Enter and Leave Animal Cells 

Several glucose transporters mediate the thermodynamically downhill move- 
ment of glucose across the plasma membranes of animal cells. Each mem- 
ber of this protein family, named GLUTl to GLUT5, consists of a single 
polypeptide chain about 500 residues long (Table 16.4). The common struc- 
tural theme is the presence of 12 transmembrane segments (Figure 16.22). 
The members of this family have distinctive roles: 

1 . GLUTl and GLUT3, present in nearly all mammalian cells, are re- 
sponsible for basal glucose uptake. Their value for glucose is about 
1 mM, significantly less than the normal serum-glucose levei, which typi- 
cally ranges from 4 mM to 8 mM. Hence, GLUTl and GLUT3 continu- 
ally transport glucose into cells at an essentially constant rate. 

2 . GLUT2, present in liver and pancreatic j8 cells, is distinctive in having 
a very high value for glucose (15-20 mM). Hence, glucose enters these 
tissues at a biologically significant rate only when there is much glucose in 
the blood. The pancreas can thereby sense the glucose levei and accordingly 
adjust the rate of insulin secretion. Insulin signals the need to remove glu¬ 
cose from the blood for storage as glycogen or conversion into fat (Section 
30.x.x). The high value of GLUT2 also ensures that glucose rapidly 
enters liver cells only in times of plenty. 


FIGURE 16.22 Model of a mammalian 
glucose transporter. The hydrophobicity 
profile of the protein indicates 12 trans¬ 
membrane a helices. [From M. Muekier, C. 
Caruso, S. A. Baldwin, M. Pânico, M. Blench, H. 

R. Morris, W. J. Allard, G. E. Lienhard, and H. F. 
Lodish. Science 229(1985);941.] 
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TABLE 16.4 Family of glucose transporters 


Name 

Tissue location 


Comments 

GLUTl 

All mammalian 
tissues 

1 mM 

Basal glucose uptake 

GLUT2 

Liver and pancreatic 
/3 cells 

15—20 mM 

In the pancreas, plays 
a role in regulation 
of insulin 

In the liver, removes 
excess glucose from the 
blood 

GLUT3 

All mammalian 
tissues 

1 mM 

Basal glucose uptake 

GLUT4 

Muscle and fat cells 

5mM 

Amount in muscle 
plasma membrane 
increases with 
endurance training 

GLUT5 

Small intestine 


Primarily a fructose 
transporter 
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3. GLUT4, which has a Km value of 5 mM, transports glucose into mus- 
cle and fat cells. The presence of insulin, which signals the fed State, leads 
to a rapid increase in the number of GLUT4 transporters in the plasma 
membrane. Hence, insulin promotes the uptake of glucose by muscle and 
fat. The amount of this transporter present in muscle membranes increases 
in response to endurance exercise training. 

4. GLUT5, present in the small intestine, functions primarily as a fructose 
transporter. 

r This family of transporters vividly illustrates how isoforms of a sin¬ 
gle protein can significantly shape the metabolic character of cells 
and contribute to their diversity and functional specialization. The trans¬ 
porters are members of a superfamily of transporters called the major facil- 
itator (MF) superfamily. Members of this family transport sugars in or- 
ganisms as diverse as E. coli, Trypansoma brucei (a parasitic protozoan that 
causes sleeping sickness), and human beings. An elegant solution to the 
problem of fuel transport evolved early and has been tailored to meet the 
needs of different organisms and even different tissues. 


16.2.5 Câncer and Glycolysis 

T It has been known for decades that tumors display enhanced rates of 
glucose uptake and glycolysis. We now know that these enhanced 
rates of glucose processing are not fundamental to the development of cân¬ 
cer, but we can ask what selective advantage they might confer on câncer cells. 

Câncer cells grow more rapidly than the blood vessels to nourish them; 
thus, as solid tumors grow, they are unable to obtain oxygen efficiently. In 
other words, they begin to experience hypoxia. Under these conditions, gly¬ 
colysis leading to lactic acid fermentation becomes the primary source of 
ATP. Glycolysis is made more efficient in hypoxic tumors by the action of 
a transcription factor, hypoxia-inducible transcription factor (HIF-1). In the 
absence of oxygen, HIF-1 increases the expression of most glycolytic en- 
zymes and the glucose transporters GLUTl and GLUT3 (Table 16.5). In 


TABLE 16.5 Proteins in glucose 
metabolism encoded by genes 
regulated by hypoxia-inducible 
factor 

GLUTl 

GLUT3 

Hexokinase 

Phosphofructokinase 

Aldolase 

Glyceraldehyde 3-phosphate 
dehydrogenase 
Phosphoglycerate kinase 
Enolase 

Pyruvate kinase 
Lactate dehydrogenase 














Hypoxia 



FIGURE 16.23 Alteration of gene 
expression in tumors due to hypoxia. 

The hypoxic conditions inside a tumor 
mass lead to the activation of the hypoxia- 
inducible transcription factor (HIF-1), 
which induces metabolic adaptation 
(increase in glycolytic enzymes) and 
activates angiogenic factors that stimulate 
the growth of new blood vessels. [Adapted 
from C. V. Dang and G. L. Semenza. Trends 
Biochem. Sei. 24(1999):68-72.] 


fact, glucose uptake correlates with tumor aggressiveness and a poor prog- 
nosis. These adaptations by the câncer cells enable the tumor to survive un- 
til vascularization can oceur. HIF-1 also stimulates the growth of new tu¬ 
mors by increasing the expression of signal molecules, such as vascular 
endothelial growth factor (VEGF), that facilitate the growth of blood ves¬ 
sels (Figure 16.23). Without such vascularization, the tumor would cease to 
grow and either die or remain harmlessly small. Efforts are underway to de- 
velop drugs that inhibit vascularization of tumors. 


16.3 GLUCOSE CAN BE SYNTHESIZED 
FROM NONCARBOHYDRATE PRECURSORS 

We now turn to the synthesis of glucose from noncarbohydrate precursors, a 
process called gluconeogenesis. This metabolic pathway is important because 
the brain depends on glucose as its primary fuel and red blood cells use only 
glucose as a fuel. The daily glucose requirement of the brain in a typical 
adult human being is about 120 g, which aceounts for most of the 160 g of 
glucose needed daily by the whole body. The amount of glucose present in 
body fluids is about 20 g, and that readily available from glycogen, a stor- 
age form of glucose (Section 21.x.x), is approximately 190 g. Thus, the di- 
rect glucose reserves are sufficient to meet glucose needs for about a day. 
During a longer period of starvation, glucose must be formed from noncar¬ 
bohydrate sources (Section 30.x.x). 

The gluconeogenic pathway converts pyruvate into glucose. Noncarbohy¬ 
drate precursors of glucose are first converted into pyruvate or enter the 
pathway at later intermediates such as oxaloacetate and dihydroxyacetone 
phosphate (Figure 16.24). The major noncarbohydrate precursors are lact- 
ate, amino acids, and glycerol. Lactate is formed by active skeletal muscle 
when the rate of glycolysis exceeds the rate of oxidative metabolism. Lac¬ 
tate is readily converted into pyruvate by the action of lactate dehydrog- 
enase (Section 16.1.9). Amino acids are derived from proteins in the diet 
and, during starvation, from the breakdown of proteins in skeletal muscle 
(Section 30.x.x). The hydrolysis of triacylglycerols (Section 22.x.x) in fat 
cells yields glycerol and fatty acids. Glycerol is a precursor of glucose, but 
animais cannot convert fatty acids into glucose, for reasons that will be dis- 
cussed later (Section IS.x.x). Glycerol may enter either the gluconeogenic 
or the glycolytic pathway at dihydroxyacetone phosphate. 
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The major site of gluconeogenesis is the liver, with a small amount also 
taking place in the kidney. Little gluconeogenesis takes place in the brain, 
skeletal muscle, or heart muscle. Rather, gluconeogenesis in the liver and kid¬ 
ney helps to maintain the glucose levei in the blood so that brain and muscle 
can extract sufficient glucose from it to meet their metabolic demands. 

16.3.1 Gluconeogenesis Is Not a Reversal of Glycolysis 

In glycolysis, glucose is converted into pyruvate; in gluconeogenesis, pyru¬ 
vate is converted into glucose. However, gluconeogenesis is not a reversal of 
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FIGURE 16.24 Pathway of gluconeo- 
genesís. The distinctive reactions and 
enzymes of this pathway are shown in 
red. The other reactions are common to 
glycolysis. The enzymes for gluconeo- 
genesis are located in the cytosol, 
except for pyruvate carboxylase (in the 
mitochondria) and glucose 6-phosphatase 
(membrane bound in the endoplasmic 
reticuium). The entry points for lactate, 
glycerol, and amino acids are shown. 
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glycolysis. Several reactions must differ because the equilibrium of glycoly¬ 
sis lies far on the side of pyruvate formation. The actual AG for the forma- 
tion of pyruvate from glucose is about —20 kcal moP^ ( —84 kj moP^) un- 
der typical cellular conditions. Most of the decrease in free energy in 
glycolysis takes place in the three essentially irreversible steps catalyzed by 
hexokinase, phosphofructokinase, and pyruvate kinase. 

Glucose + ATP > glucose 6 -phosphate + ADP 

AG = — 8.0 kcal moP^ ( — 33 kJ moP^) 


T-, , 1 . I A Phosphofructokinase 

rructose 6 -phosphate + Al P--> 

fructose 1 , 6 -bisphosphate + ADP 
AG = —5.3 kcal moP^ ( — 22 kJ moP^) 


Phosphoenolpyruvate + ADP > pyruvate + ATP 

AG = —4.0 kcal moD^ ( — 17 kJ moD^) 

In gluconeogenesis, the following new steps bypass these virtually irre¬ 
versible reactions of glycolysis: 

1 . Phosphoenolpyruvate is formed from pyruvate by way of oxaloacetate 
through the action of pyruvate carboxylase and phosphoenolpyruvate 
carboxy kinase. 

Pyruvate + CO 2 + ATP + H 2 O carboxylase ^ 

oxaloacetate + ADP + Pi + 2 


Phosphoenolpyruvate carboxykinase 

Oxaloacetate + GTP-> 

phosphoenolpyruvate +GDP + CO 2 

2. Fructose 6-phosphate is formed from fructose 1,6-bisphosphate by hydro- 
lysis of the phosphate ester at carbon I . Fructose 1,6-bisphosphatase catalyzes 
this exergonic hydrolysis. 

Fructose 1,6-bisphosphate + H 2 O-> fructose 6-phosphate + Pi 

3. Glucose is formed by hydrolysis of glucose 6-phosphate in a reaction cat¬ 
alyzed by glucose 6-phosphatase. 

Glucose 6-phosphate + H 2 O-> glucose + Pi 

We will examine each of these steps in turn. 

16.3.2 The Conversion of Pyruvate into Phosphoenolpyruvate 
Begins with the Formation of Oxaloacetate 

The first step in gluconeogenesis is the carboxylation of pyruvate to form 
oxaloacetate at the expense of a molecule of ATP. Then, oxaloacetate is de- 
carboxylated and phosphorylated to yield phosphoenolpyruvate, at the ex¬ 
pense of the high phosphoryl-transfer potential of GTP. Both of these re¬ 
actions take place inside the mitochondria. 

Pyruvate + CO 2 + ATP + H 2 O v oxaloacetate + ADP + Pi + 2 
Oxaloacetate + GTP v phosphoenolpyruvate + GDP + CO 2 

The first reaction is catalyzed by pyruvate carboxylase and the second by 
phosphoenolpyruvate carboxykinase. The sum of these reactions is: 
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FIGURE 16.25 Domain structure of pyruvate carboxylase. The ATP-grasp domain 
activates HCOs" and transfers CO 2 to the biotin-binding domain. From there, the CO 2 is 
transferred to pyruvate generated in the central domain. 


Glucose Synthesis 


Pyruvate + ATP + GTP + H 2 O y 

phosphoenolpyruvate + ADP + GDP + Pi + 2 

Pyruvate carboxylase is of special interest because of its structural, catalytic, 
and allosteric properties. The N-terminal 300 to 350 amino acids form an 
ATP-grasp domain (Figure 16.25), which is a widely used ATP-activating 
domain to be discussed in more detail when we investigate nucleotide 
biosynthesis (Section 25.x.x). The C-terminal 80 amino acids constitute a 
biotin-binding domain (Figure 16.26) that we will see again in fatty acid 
synthesis (Section 22.x.x). Biotin is a covalently attached prosthetic group, 
which serves as a carrier of activated CO 2 . The carboxylate group of biotin 
is linked to the e- amino group of a specific lysine residue by an amide bond 
(Figure 16.27). Note that biotin is attached to pyruvate carboxylase by a 
long, flexible chain. 

The carboxylation of pyruvate takes place in three stages: 

HCOs” + ATP HOCOa-POs^” + ADP 
Biotin-enzyme + H 0 C 02 -P 03 ^ ^ C02-biotin-enzyme + Pi 

C02-biotin-enzyme + pyruvate y biotin-enzyme + oxaloacetate 

Recall that, in aqueous Solutions, CO 2 exists as HCOs” with the aid of 
carbonic anhydrase (Section 9.2). The HC 03 ~ is activated to carboxyphos- 
phate. This activated CO 2 is subsequently bonded to the N-1 atom of the 
biotin ring to form the carboxybiotin-enzyme intermediate (see Figure 16.27). 
The CO 2 attached to the biotin is quite activated. The Z\G°' for its cleavage 

CO 2 -biotin-enzyme + -> CO 2 + biotin-enzyme 

is —4.7 kcal moP^ ( —20 kj moP^). This negative2\G°' indicates that car- 
boxybiotin is able to transfer CO 2 to acceptors without the input of addi- 
tional free energy. 



FIGURE 16.26 Biotin-binding domain of pyruvate carboxylase. This likely 
structure is based on the structure of the homologous domain from the enzyme acetyl 
CoA carboxylase (Section 22.x.x). The biotin is on a flexible tether, allowing it to move 
between the ATP-bicarbonate site and the pyruvate site. 



FIGURE 16.27 Structure of 
carboxybiotin. 
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^ Oxaloacetate 


FIGURE 16.28 Compartmental 
cooperation. Oxaloacetate utilized in the 
cytosol for gluconeogenesis is formed in 
the mitochondrial matrix by carboxylation 
of pyruvate. Oxaloacetate leaves the 
mitochondrion by a specific transport 
System (not shown) in the form of malate, 
which is reoxidized to oxaloacetate in 
the cytosol. 


The activated carboxyl group is then transferred from carboxybiotin to 
pyruvate to form oxaloacetate. The long, flexible link between biotin and 
the enzyme enables this prosthetic group to rotate from one active site of 
the enzyme (the ATP-bicarbonate site) to the other (the pyruvate site). 

The first partial reaction of pyruvate carboxylase, the formation of car¬ 
boxybiotin, depends on the presence of acetyl CoA. Biotin is not carboxy- 
lated unless acetyl CoA is bound to the enzyme. Acetyl CoA has no effect on 
the second partial reaction. The allosteric activation of pyruvate carboxy¬ 
lase by acetyl CoA is an important physiological control mechanism that 
will be discussed in Section IS.x.x. 


16.3.3 Oxaloacetate Is Shuttled into the Cytosol and 
Converted into Phosphoenolpyruvate 

Pyruvate carboxylase is a mitochondrial enzyme, whereas the other enzymes 
of gluconeogenesis are cytoplasmic. Oxaloacetate, the product of the pyru¬ 
vate carboxylase reaction, is reduced to malate inside the mitochondrion for 
transport to the cytosol. The reduction is accomplished by an NADH- 
linked malate dehydrogenase. When malate has been transported across the 
mitochondrial membrane, it is reoxidized to oxaloacetate by an NAD^- 
linked malate dehydrogenase in the cytosol (Figure 16.28). 

Finally, oxaloacetate is simultaneously decarboxylated diVid phosphorylated 
by phosphoenolpyruvate carboxykinase in the cytosol. The CO 2 that was 
added to pyruvate by pyruvate carboxylase comes off in this step. Recall 
that, in glycolysis, the presence of a phosphoryl group traps the unstable 
enol isomer of pyruvate as phosphoenolpyruvate (Section 16.1.7). In glu¬ 
coneogenesis, the formation of the unstable enol is driven by decarboxyla- 
tion —the oxidation of the carboxylic acid to CO 2 — and trapped by the ad- 
dition of a phosphate to carbon 2 from GTP. The two-step pathway for the 
formation of phosphoenolpyruvate from pyruvate has a AG°' of + 0.2 kcal 
moP^ ( + 0.13 kj moP^) in contrast with +7.5 kcal moP^ ( + 31 kj moP^) 
for the reaction catalyzed by pyruvate kinase. The much more favorable 
AG°' for the two-step pathway results from the use of a molecule of ATP 
to add a molecule of CO 2 in the carboxylation step that can be removed to 
power the formation of phosphoenolpyruvate in the decarboxylation step. 
Decarboxylations often drive reactions otherwise highly endergonic. This meta- 
bolic motif is used in the citric acid cycle (Section 1 S.x.x), the pentose phos¬ 
phate pathway (Section 17.x.x), and fatty acid synthesis (Section 22.x.x). 

16.3.4 The Conversion of Fructose 1,6-bisphosphate into 
Fructose 6-phosphate and Pi Is an Irreversible Step 

On formation, phosphoenolpyruvate is metabolized by the enzymes of gly¬ 
colysis but in the reverse direction. These reactions are near equilibrium un- 
der intracellular conditions; so, when conditions favor gluconeogenesis, the 
reverse reactions will take place until the next irreversible step is reached. 
This step is the hydrolysis of fructose 1,6-bisphosphate to fructose 6-phos¬ 
phate and Pi. 

^ H ^ 1 • 1 1 I TT Fructose 1,6-bisphosphatase 

rructose 1,6-bisphosphate + 1^20 -> 

fructose 6-phosphate + Pi 

The enzyme responsible for this step is fructose 1,6-bisphosphatase. 
Like its glycolytic counterpart, it is an allosteric enzyme that participates in 
the regulation of gluconeogenesis. We will return to its regulatory proper- 
ties later in the chapter. 








































































16.3.5 The Generation of Free Glucose Is an 
Important Control Point 

The fructose 6 -phosphate generated by fructose 1, 6 -bisphosphatase is read- 
ily converted into glucose 6 -phosphate. In most tissues, gluconeogenesis 
ends here. Free glucose is not generated; rather, the glucose 6 -phosphate is 
processed in some other fashion, notably to form glycogen. One advantage 
to ending gluconeogenesis at glucose 6 -phosphate is that, unlike free glu¬ 
cose, the molecule cannot diffuse out of the cell. To keep glucose inside the 
cell, the generation of free glucose is controlled in two ways. First, the en- 
zyme responsible for the conversion of glucose 6 -phosphate into glucose, 
glucose 6-phosphatase, is regulated. Second, the enzyme is present only in 
tissues whose metabolic duty is to maintain blood-glucose homeostasis— 
tissues that release glucose into the blood. These tissues are the liver and to 
a lesser extent the kidney. 

This final step in the generation of glucose does not take place in the cy- 
tosol. Rather, glucose 6 -phosphate is transported into the lumen of the en- 
doplasmic reticulum, where it is hydrolyzed to glucose by glucose 6 -phos¬ 
phatase, which is bound to the membrane (Figure 16.29). An associated 
Ca^^-binding stabilizing protein is essential for phosphatase activity. Glu¬ 
cose and Pi are then shuttled back to the cytosol by a pair of transporters. 
The glucose transporter in the endoplasmic reticulum membrane is like 
those found in the plasma membrane (Section 16.2.4). It is striking that five 
proteins are needed to transform cytosolic glucose 6 -phosphate into glucose. 
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FIGURE 16.29 Generation of glucose 
from glucose 6-phosphate. Several en¬ 
doplasmic reticulum (ER) proteins play a 
role in the generation of glucose from glu¬ 
cose 6-phosphate. TI transports glucose 
6-phosphate into the lumen of the ER, 
whereas T2 and T3 transport Pj and glu¬ 
cose, respectively, back into the cytosol. 
Glucose 6-phosphatase is stabilized by a 
Ca^^-binding protein (SP). [After A. Buchell 
and I. D. Waddel. Biochem. Biophys. Acta 
1092(1991):129.] 


16.3.6 Six High-Energy Phosphate Bonds Are Spent 
in Synthesizing Glucose from Pyruvate 

The stoichiometry of gluconeogenesis is: 

2 Pyruvate + 4 ATP + 2 GTP + 2 NADH + 6 H 2 O-> 

glucose + 4 ADP + 2 GDP + 6 Pi + 2 NAD^ + 2 

AG = —9 kcal moP^ ( — 38 kj moP^) 

In contrast, the stoichiometry for the rever sal of glycolysis is: 

2 Pyruvate + 2 ATP + 2 NADH + 2 H 2 O-> 

glucose + 2 ADP + 2 Pi + 2 NAD^ 
AG = +20 kcal moP^ ( + 84 kJ moP^) 

Note that six nucleotide triphosphate molecules are hydrolyzed to synthe- 
size glucose from pyruvate in gluconeogenesis, whereas only two molecules 
of ATP are generated in glycolysis in the conversion of glucose into pyru¬ 
vate. Thus, the extra cost of gluconeogenesis is four high phosphoryl-trans- 
fer potential molecules per molecule of glucose synthesized from pyruvate. 
The four additional high phosphoryl-transfer potential molecules are 
needed to turn an energetically unfavorable process (the reversal of gly¬ 
colysis, AG°' = + 20 kcal mol~^ [ + 84 kJ moP^]) into a favorable one 
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(gluconeogenesis, AG°' = —9 kcal moP^ [ — 38 kj moP^]). This is a clear 
example of the coupling of reactions: ATP hydrolysis is used to power an 
energetically unfavorable reaction. 




CONCEPTUAL INSIGHTS, Energetics of Glucose Metabolism. See the section 
on gluconeogenesis in the Conceptual Insights module to review why and how gluconeo¬ 
genesis must differ from the reversa! of glycolysis. 


16.4 GLUCONEOGENESIS AND GLYCOLYSIS ARE 
RECIPROCALLY REGULATED 

Gluconeogenesis and glycolysis are coordinated so that within a cell one 
pathway is relatively inactive while the other is highly active. If both sets of 
reactions were highly active at the same time, the net result would be the 
hydrolysis of four nucleotide triphosphates (two ATP plus two GTP) per 
reaction cycle. Both glycolysis and gluconeogenesis are highly exergonic un- 
der cellular conditions, and so there is no thermodynamic barrier to such 
simultaneous activity. However, the amounts and activities of the distinctive 
enzymes of each pathway are controlled so that both pathways are not highly 
active at the same time. The rate of glycolysis is also determined by the con- 
centration of glucose, and the rate of gluconeogenesis by the concentrations 
of lactate and other precursors of glucose. 

The interconversion of fructose 6-phosphate and fructose 1,6-bisphos- 
phate is stringently controlled (Figure 16.30). As discussed in Section 
16.2.1, AMP stimulates phosphofructokinase, whereas ATP and citrate in- 
hibit it. Fructose 1,6-bisphosphatase, on the other hand, is inhibited by 
AMP and activated by citrate. A high levei of AMP indicates that the en- 
ergy charge is low and signals the need for ATP generation. Conversely, 


FIGURE 16.30 Reciprocai regulation of 
gluconeogenesis and glycolysis in the 
liver. The levei of fructose 2,6-bisphosphate 
is high in the fed State and low in starva- 
tion. Another important control is the inhi- 
bition of pyruvate kinase by phosphoryla- 
tion during starvation. 
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high leveis of ATP and citrate indicate that the energy charge is high and 
that biosynthetic intermediates are abundant. Under these conditions, gly- 
colysis is nearly switched off and gluconeogenesis is promoted. 

Phosphofructokinase and fructose 1,6-bisphosphatase are also recipro- 
cally controlled hy fructose 2,6-bisphosphate in the liver (Section 16.2.2). The 
levei of F-2,6-BP is low during starvation and high in the fed state, because 
of the antagonistic effects of glucagon and insulin on the production and 
degradation of this signal molecule. Fructose 2,6-bisphosphate strongly stim- 
ulates phosphofructokinase and inhibits fructose 1,6-bisphosphatase. Hence, 
glycolysis is accelerated and gluconeogenesis is diminished in the fed state. 
During starvation, gluconeogenesis predominates because the levei of 
F-2,6-BP is very low. Glucose formed by the liver under these conditions 
is essential for the viability of brain and muscle. 

The interconversion of phosphoenolpyruvate and pyruvate also is pre- 
cisely regulated. Recall that pyruvate kinase is controlled by allosteric ef- 
fectors and by phosphorylation (Section 16.2.3). High leveis of ATP and 
alanine, which signal that the energy charge is high and that building blocks 
are abundant, inhibit the enzyme in liver. Conversely, pyruvate carboxy- 
lase, which catalyzes the first step in gluconeogenesis from pyruvate, is ac- 
tivated by acetyl Co A and inhibited by ADP. Likewise, ADP inhibits phos¬ 
phoenolpyruvate carboxykinase. Hence, gluconeogenesis is favored when 
the cell is rich in biosynthetic precursors and ATP. 

The amounts and the activities of these essential enzymes also are reg¬ 
ulated. The regulators in this case are hormones. Hormones affect gene ex- 
pression primarily by changing the rate of transcription, as well as by reg- 
ulating the degradation of mRNA. Insulin, which rises subsequent to 
eating, stimulates the expression of phosphofructokinase, pyruvate kinase, 
and the bifunctional enzyme that makes and degrades F-2,6-BP. Glucagon, 
which rises during starvation, inhibits the expression of these enzymes and 
stimulates instead the production of two key gluconeogenic enzymes, phos¬ 
phoenolpyruvate carboxykinase and fructose 1,6-bisphosphatase. Tran- 
scriptional control in eukaryotes is much slower than allosteric control; it 
takes hours or days in contrast with seconds to minutes. The richness and 
complexity of hormonal control are graphically displayed by the promoter 
of the phosphoenolpyruvate carboxykinase gene, which contains regulatory 
sequences that respond to insulin, glucagon, glucocorticoids, and thyroid 
hormone (Figure 16.31). 

-500 1 


TRE CREI CREII 

GRE 

FIGURE 16.31 The promoter of the phosphoenolpyruvate carboxykinase gene. This 
promoter is approximately 500 bp in length and contains regulatory sequences (response 
elements) that mediate the action of several hormones. IRE, insulin response element; GRE, 
glucocorticoid response element; TRE, thyroid hormone response element; CREI and CREII, 
cAMP response elements. [After M. M. McGrane, J. S Jun, Y. M. Patel, and R. W. Hanson. Trends 
Biochem. Sei. 17(1992);40.] 



16.4.1 Substrate Cycles Amplify Metabolic Signals 
and Produce Heat 

A pair of reactions such as the phosphorylation of fructose 6-phosphate to 
fructose 1,6-bisphosphate and its hydrolysis back to fructose 6-phosphate 
is called a substrate cycle. As already mentioned, both reactions are not si- 
multaneously fully active in most cells, because of reciprocai allosteric 
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ATP ADP 



Net flux of B = 10 


ATP ADP 



Net flux of B = 48 


FIGURE 16.32 Substrate cycle. This 
ATP-driven cycle operates at two different 
rates. A small change in the rates of the 
two opposing reactions resuits in a large 
change in the net flux of product B. 


Controls. However, the resuits of isotope-labeling studies have shown that 
some fructose 6-phosphate is phosphorylated to fructose 1,6-bisphosphate 
in gluconeogenesis. There also is a limited degree of cycling in other pairs 
of opposed irreversible reactions. This cycling was regarded as an imper- 
fection in metabolic control, and so substrate cycles have sometimes been 
called futile cycles. Indeed, there are pathological conditions, such as ma- 
lignant hyperthermia, in which control is lost and both pathways proceed 
rapidly with the concomitant generation of heat by the rapid, uncontrolled 
hydrolysis of ATP. 

Despite such extraordinary circumstances, it now seems likely that sub¬ 
strate cycles are biologically important. One possibility is that substrate cy¬ 
cles amplify metabolic signals. Suppose that the rate of conversion of A into 
B is 100 and of B into A is 90, giving an initial net flux of 10. Assume that 
an allosteric effector increases the A —> B rate by 20% to 120 and recipro- 
cally decreases the B —> A rate by 20% to 72. The new net flux is 48, and so 
a 20% change in the rates of the opposing reactions has led to a 380% increase 
in the net flux. In the example shown in Figure 16.32, this amplification is 
made possible by the rapid hydrolysis of ATP. It has been suggested that 
the flux down the glycolytic pathway may increase 1000-fold at the initia- 
tion of intense exercise. Because it seems unlikely that allosteric activation 
of enzymes alone could explain this increased flux, the existence of substrate 
cycles may partly account for the rapid rise in the rate of glycolysis. 

The other potential biological role of substrate cycles is the generation of 
heat produced by the hydrolysis of ATP. A striking example is provided by 
bumblebees, which must maintain a thoracic temperature of about 30°C to 
fly. A bumblebee is able to maintain this high thoracic temperature and for¬ 
age for food even when the ambient temperature is only 10 ° C because phos- 
phofructokinase and fructose 1,6-bisphosphatase in its flight muscle are si- 
multaneously highly active; the continuous hydrolysis of ATP generates 
heat. This bisphosphatase is not inhibited by AMP, which suggests that the 
enzyme is specially designed for the generation of heat. In contrast, the hon- 
eybee has almost no fructose 1,6-bisphosphatase activity in its flight mus¬ 
cle and consequently cannot fly when the ambient temperature is low. 

16.4.2 Lactate and Alanine Formed by Contracting Muscle 
Are Converted into Glucose by the Liver 

The major raw materiais of gluconeogenesis are lactate and alanine produced 
by active skeletal muscle and erythrocytes. Erythrocytes lack mitochondria 
and can never oxidize glucose completely. In contracting skeletal muscle 
during vigorous exercise, the rate at which glycolysis produces pyruvate ex- 
ceeds the rate at which the citric acid cycle oxidizes it. Under these condi¬ 
tions, moreover, the rate of formation of NADH by glycolysis is greater 
than the rate of its oxidation by aerobic metabolism. Continued glycolysis 
depends on the availability of NAD^ for the oxidation of glyceraldehyde 
3-phosphate. The accumulation of both NADH and pyruvate is reversed 
by lactate dehydrogenase, which oxidizes NADH to NAD^ as it reduces 
pyruvate to lactate (Section 16.1.7). However, lactate is a dead end in me¬ 
tabolism. It must be converted back into pyruvate before it can be metab- 
olized. The only purpose of the reduction of pyruvate to lactate is to regen- 
erate NAD^ so that glycolysis can proceed in active skeletal muscle and 
erythrocytes. The formation of lactate buys time and shifts part of the meta¬ 
bolic burdenfrom muscle to liver. 

The plasma membrane of most cells contains carriers that render them 
highly permeable to lactate and pyruvate. Both substances diffuse out of ac¬ 
tive skeletal muscle into the blood and are carried to the liver. Much more 
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FIGURE 16.33 The Corí 
cycle. Lactate formed by 
active muscle is converted 
into glucose by the liver. 
This cycle shifts part of 
the metabolic burden of 
active muscle to the liver. 


Reciprocai Regulation 


lactate than pyruvate is transported out because the high NADH/NAD^ ra- 
tio in contracting skeletal muscle favors the conversion of pyruvate into lac¬ 
tate. The lactate that enters the liver is oxidized to pyruvate, a reaction fa- 
vored by the low NADH/NAD^ ratio in the cytosol of liver cells. Pyruvate 
in the liver is converted into glucose by the gluconeogenic pathway. Glu¬ 
cose then enters the blood and is taken up by skeletal muscle. Thus, the liver 
furnishes glucose to contracting skeletal muscle, which derives ATP from the 
glycolytic conversion of glucose into lactate. Contracting skeletal muscle sup- 
plies lactate to the liver, which uses it to synthesize glucose. These reactions 
constitute the Cori cycle (Figure 16.33). Studies have shown that alanine, like 
lactate, is a major precursor of glucose. In muscle, alanine is formed from 
pyruvate by transamination (Section 24.x.x); the reverse reaction takes place 
in the liver. The interplay between glycolysis and gluconeogenesis is sum- 
marized in Figure 16.34, which shows how these two pathways help to meet 
the energy needs of different cell types. 

Y lsozymic forms of lactate dehydrogenase in different tissues catalyze 
the interconversions of pyruvate and lactate. Lactate dehydrogenase 
is a tetramer of two kinds of 35-kd subunits encoded by similar genes: the 
H type predominates in the heart, and the homologous M type in skeletal 
muscle and the liver. These subunits associate to form five types of tetramers: 
H 4 , H 3 M 1 , H 2 M 2 , H 1 M 3 , and M 4 . The H 4 isozyme (type 1) has higher 



FIGURE 16.34 Cooperation between 
glycolysis and gluconeogenesis. 

Glycolysis and gluconeogenesis are 
coordinated, in a tissue-specific fashion, to 
ensure that the glucose-dependent energy 
needs of all cells are met. 




























-1 16-36 1- 

CHAPTER 16 • Glycolysis and 
Gluconeogenesis 


affinity for substrates than does the M4 isozyme (type 5) and, unlike M4, is 
allosterically inhibited by high leveis of pyruvate. The other isozymes have 
intermediate properties, depending on the ratio of the two kinds of chains. 
H4 is designed to oxidize lactate to pyruvate, which is then utilized as a fuel 
by the heart through aerobic metabolism. Indeed, heart muscle never func- 
tions anaerobically. In contrast, M4 is optimized to operate in the reverse 
direction, to convert pyruvate into lactate to allow glycolysis to proceed un- 
der anaerobic conditions. We see here an example of how gene duplication 
and divergence generate a series of homologous enzymes that foster meta- 
bolic cooperation between organs. 

16.4.3 Glycolysis and Gluconeogenesis Are 
Evolutionarily Intertwined 

Y The metabolism of glucose has ancient origins. Organisms living in 
the early biosphere depended on the anaerobic generation of energy 
until significant amounts of oxygen began to accumulate 2 billion years ago. 
The fact that glycolytic enzymes with similar properties do not have simi¬ 
lar amino acid sequences also provides a clue to how the pathway originated. 
Although there are four kinases and two isomerases in the pathway, both 
sequence and structural comparisons do not suggest that these sets of en¬ 
zymes are related to one another by divergent evolution. The absence of 
such similarities implies that glycolytic enzymes were derived independ- 
ently rather than by gene duplication. The common dinucleotide-binding 
domain found in the dehydrogenases (Section 16.1.10) and the njS barreis 
are the only major recurring elements. 

We can speculate on the relationship between glycolysis and gluconeo¬ 
genesis if we think of glycolysis as consisting of two segments: the metab¬ 
olism of hexoses (the upper segment) and the metabolism of trioses (the 
lower segment). The enzymes from the upper segment are different in some 
species and are missing entirely in some archaea, whereas enzymes from the 
lower segment are quite conserved. In fact, four enzymes of the lower seg¬ 
ment are present in all species. This lower part of the pathway is common to 
glycolysis and gluconeogenesis. This common part of the two pathways 
may be the oldest part, constituting the core to which the other steps were 
added. The upper part would vary according to the sugars that were avail- 
able to evolving organisms in particular niches. Interestingly, this core part 
of carbohydrate metabolism can generate triose precursors for ribose sug¬ 
ars, a component of RN A and a criticai requirement for the RN A world. 
Thus, we are left with the unanswered question, Was the original core path¬ 
way used for energy conversion or biosynthesis? 


SUMMARY 


O Glycolysis Is an Energy-Conversion Pathway in Many Organisms 

Glycolysis is the set of reactions that converts glucose into pyruvate. 
The 10 reactions of glycolysis take place in the cytosol. In the first stage, 
glucose is converted into fructose 1,6-bisphosphate by a phosphoryla- 
tion, an isomerization, and a second phosphorylation reaction. Two 
molecules of ATP are consumed per molecule of glucose in these reac¬ 
tions, which are the prelude to the net synthesis of ATR In the second 
stage, fructose 1,6-bisphosphate is cleaved by aldolase into dihydroxy- 
acetone phosphate and glyceraldehyde 3-phosphate, which are readily 
interconvertible. In the third stage, ATP is generated. Glyceraldehyde 
3-phosphate is oxidized and phosphorylated to form 1,3-bisphospho- 






glycerate, an acyl phosphate with a high phosphoryl-transfer potential. 
This molecule transfers a phosphoryl group to ADP to form ATP and 
3 -phosphoglycerate. A phosphoryl shift and a dehydration form phos- 
phoenolpyruvate, a second intermediate with a high phosphoryl- 
transfer potential. Another molecule of ATP is generated as phospho- 
enolpyruvate is converted into pyruvate. There is a net gain of two mol- 
ecules of ATP in the formation of two molecules of pyruvate from one 
molecule of glucose. 

The electron acceptor in the oxidation of glyceraldehyde 3 -phos¬ 
phate is NAD^, which must be regenerated for glycolysis to continue. 
In aerobic organisms, the NADH formed in glycolysis transfers its elec- 
trons to O2 through the electron-transport chain, which thereby regen- 
erates NAD^. Under anaerobic conditions and in some microorgan- 
isms, NAD^ is regenerated by the reduction of pyruvate to lactate. In 
other microorganisms, NAD^ is regenerated by the reduction of pyru¬ 
vate to ethanol. These two processes are examples of fermentations. 

• The Glycolytic Pathway Is Tightly Controlled 

The glycolytic pathway has a dual role: it degrades glucose to generate 
ATP, and it provides building blocks for the synthesis of cellular com- 
ponents. The rate of conversion of glucose into pyruvate is regulated to 
meet these two major cellular needs. Under physiologic conditions, the 
reactions of glycolysis are readily reversible except for the ones catalyzed 
by hexokinase, phosphofructokinase, and pyruvate kinase. Phospho- 
fructokinase, the most important control element in glycolysis, is in- 
hibited by high leveis of ATP and citrate, and it is activated by AMP 
and fructose 2,6-bisphosphate. In the liver, this bisphosphate signals 
that glucose is abundant. Hence, phosphofructokinase is active when 
either energy or building blocks are needed. Hexokinase is inhibited by 
glucose 6-phosphate, which accumulates when phosphofructokinase is 
inactive. ATP and alanine allosterically inhibit pyruvate kinase, the 
other control site, and fructose 1,6-bisphosphate activates the enzyme. 
Consequently, pyruvate kinase is maximally active when the energy 
charge is low and glycolytic intermediates accumulate. 

• Glucose Can Be Synthesized from Noncarbohydrate Precursors 

Gluconeogenesis is the synthesis of glucose from noncarbohydrate 
sources, such as lactate, amino acids, and glycerol. Several of the reac¬ 
tions that convert pyruvate into glucose are common to glycolysis. Glu¬ 
coneogenesis, however, requires four new reactions to bypass the es- 
sential irreversibility of three reactions in glycolysis. In two of the new 
reactions, pyruvate is carboxylated in mitochondria to oxaloacetate, 
which in turn is decarboxylated and phosphorylated in the cytosol to 
phosphoenolpyruvate. Two high-energy phosphate bonds are con- 
sumed in these reactions, which are catalyzed by pyruvate carboxylase 
and phosphoenolpyruvate carboxykinase. Pyruvate carboxylase con- 
tains a biotin prosthetic group. The other distinctive reactions of glu¬ 
coneogenesis are the hydrolyses of fructose 1,6-bisphosphate and glu¬ 
cose 6-phosphate, which are catalyzed by specific phosphatases. The 
major raw materiais for gluconeogenesis by the liver are lactate and ala¬ 
nine produced from pyruvate by active skeletal muscle. The formation 
of lactate during intense muscular activity buys time and shifts part of 
the metabolic burden from muscle to the liver. 

• Gluconeogenesis and Glycolysis Are Reciprocally Regulated 

Gluconeogenesis and glycolysis are reciprocally regulated so that one 
pathway is relatively inactive while the other is highly active. Phos¬ 
phofructokinase and fructose 1,6-bisphosphatase are key control points. 
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Fructose 2,6-bisphosphate, an intracellular signal molecule present at 
higher leveis when glucose is abundant, activates glycolysis and inhibits 
gluconeogenesis by regulating these enzymes. Pyruvate kinase and 
pyruvate carboxylase are regulated by other effectors so that both are 
not maximally active at the same time. Allosteric regulation and re- 
versible phosphorylation, which are rapid, are complemented by tran- 
scriptional control, which takes place in hours or days. 
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- ^PROBLEMS I- 

1. Kitchen chemistry. Sucrose is commonly used to preserve 
fruits. Why is glucose not suitable for preserving foods? 

2. Tracing carbon atoms L Glucose labeled with at C-1 is 
incubated with the glycolytic enzymes and necessary cofactors. 

(a) What is the distribution of in the pyruvate that is 
formed? (Assume that the interconversion of glyceraldehyde 
3-phosphate and dihydroxyacetone phosphate is very rapid 
compared with the subsequent step.) 

(b) If the specific activity of the glucose substrate is 10 mCi 
mM~^, what is the specific activity of the pyruvate that is 
formed? 

3. Lactic acid fermentation. (a) Write a balanced equation for 
the conversion of glucose into lactate. 

(b) Calculate the standard free-energy change of this reaction 
by using the data given in Table 16-3 and the fact that AG°' is 
— 6 kcal for the foliowing reaction: 

Pyruvate + NADH + ^ lactate + NAD^ 

What is the free-energy change (AG, not AG°') of this reaction 
when the concentrations of reactants are: glucose, 5 mM; lactate, 
0.05 mM; ATP, 2 mM; ADR, 0.2 mM; and Pi, 1 mM? 

4. High potential. What is the equilibrium ratio of phospho- 
enolpyruvate to pyruvate under standard conditions when 
[ATP]/[ADR] = 10? 


5. Hexose-triose equilibrium. What are the equilibrium con¬ 
centrations of fruetose 1,6-bisphosphate, dihydroxyacetone 
phosphate, and glyceraldehyde 3-phosphate when 1 mM frue¬ 
tose 1,6-bisphosphate is incubated with aldolase under standard 
conditions? 

6. Double labeling. 3-Phosphoglycerate labeled uniformly with 
is incubated with 1,3-BPG labeled with ^^P at C-1. What is 

the radioisotope distribution of the 2,3-BPG that is formed on 
addition of BPG mutase? 

7. An informative analog. Xylose has the same strueture as that 
of glucose except that it has a hydrogen atom at C-6 in place of 
a hydroxymethyl group. The rate of ATP hydrolysis by hexo- 
kinase is markedly enhanced by the addition of xylose. Why? 

8. Distinctive sugars. The intravenous infusion of fruetose into 
healthy volunteers leads to a two- to fivefold increase in the levei 
of lactate in the blood, a far greater increase than that observed 
after the infusion of the same amount of glucose. 

(a) Why is glycolysis more rapid after the infusion of fruetose? 
(a) Fruetose has been used in place of glucose for intravenous 
feeding. Why is this use of fruetose unwise? 

9. Metabolic mutants. Predict the effect of each of the foliow- 
ing mutations on the pace of glycolysis in liver cells: 

(a) Loss of the allosteric site for ATP in phosphofructokinase. 

(b) Loss of the binding site for citrate in phosphofructokinase. 
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(c) Loss of the phosphatase domain of the bifunctional enzyme 
that Controls the levei of fructose 2,6-bisphosphate. 

(d) Loss of the binding site for fructose 1,6-bisphosphate in 
pyruvate kinase. 

10. Metabolic mutant. What are the likely consequences of a ge- 
netic disorder rendering fructose 1,6-bisphosphatase in liver less 
sensitive to regulation by fructose 2,6-bisphosphate? 

11. Biotin snatcher. Avidin, a 70-kd protein in egg white, has 
very high affinity for biotin. In fact, it is a highly specific in- 
hibitor of biotin enzymes. Which of the following conversions 
would be blocked by the addition of avidin to a cell homogenate? 

(a) Glucose-> pyruvate 

(b) Pyruvate-> glucose 

(c) Oxaloacetate-> glucose 

(d) Malate-> oxaloacetate 

(e) Pyruvate-> oxaloacetate 

(f) Glyceraldehyde 3-phosphate-> fructose 1,6-bisphosphate 

12. Tracing carbon atoms 11. If cells synthesizing glucose from 
lactate are exposed to CO 2 labeled with what will be the 
distribution of labei in the newly synthesized glucose? 

13. Arsenate poisoning. Arsenate (As 04 ^~) closely resembles Pi 
in structure and reactivity. In the reaction catalyzed by glycer¬ 
aldehyde 3-phosphate dehydrogenase, arsenate can replace 
phosphate in attacking the energy-rich thioester intermediate. 
The product of this reaction, l-arseno-3-phosphoglycerate, is 
unstable. It and other acyl arsenates are rapidly and sponta- 
neously hydrolyzed. What is the effect of arsenate on energy gen- 
eration in a cell? 

14. Reduce, reuse, recycle. In the conversion of glucose into two 
molecules of lactate, the NADH generated earlier in the path- 
way is oxidized to NAD^. Why is it not to the cells advantage 
to simply make more NAD^ so that the regeneration would not 
be necessary? After all, the cell would save much energy because 
it would no longer need to synthesize lactic acid dehydrogenase. 

15. Adenylate kinase again. Adenylate kinase, an enzyme dis- 
cussed in great detail in Chapter 9, is responsible for intercon- 
verting the adenylate nucleotide pool: 

* ^ ^ ^ ^ Adenylate kinase , 

ADP + ADP . . . 1 _ =± ATP + AMP 

The equilibrium constant for this reaction is dose to 1, inasmuch 
as the number of phosphoanhydride bonds is the same on each 
side of the equation. Using the equation for the equilibrium con¬ 
stant for this reaction, show why changes in [AMP] are a more 
effective indicator of the adenylate pool than [ATP]. 

16. Working at cross-purposes? Gluconeogenesis takes place dur- 
ing intense exercise, which seems counterintuitive. Why would 
an organism synthesize glucose and at the same time use glucose 
to generate energy? 

17. Powering pathways. Compare the stoichiometries of glycol¬ 
ysis and gluconeogenesis (Section 16.1.8). Recall that the input 
of one ATP equivalent changes the equilibrium constant of a re¬ 
action by a factor of about 10^ (Section 16.3.6). By what factor 
do the additional high phosphoryl-transfer potential compounds 
alter the equilibrium constant of gluconeogenesis? 


Mechanism Problem 

18. Argument by analogy. Propose a mechanism for the conver¬ 
sion of glucose 6-phosphate into fructose 6-phosphate by phos- 
phoglucose isomerase based on the mechanism of triose phos¬ 
phate isomerase. 

Chapter Integration Problem 

19. Not just for energy. People with galactosemia display central 
nervous system abnormalities even if galactose is eliminated 
from the diet. The precise reason for it is not known. Suggest a 
plausible explanation. 

Data Interpretation Problem 

20. Nou), that^s unusual. Phosphofructokinase has recently been 
isolated from the hyperthermophilic archaeon Pyrococcus furio- 
sus. It was subjected to standard biochemical analysis to deter¬ 
mine basic catalytic parameters. The processes under study were 
of the form: 

Fructose 6-phosphate + (x — Pi)-> 

fructose 1,6-bisphosphate + (x) 

The assay measured the increase in fructose 1,6-bisphosphate. 
Selected results are shown in the adjoining graph. 



[ADP], mM 

(a) How does the P. furiosus phosphofructokinase differ from 
the phosphofructokinase discussed in this chapter? 

(b) What effects do AMP and ATP have on the reaction with 
ADP? 

Media Problems 

21. No free lunch. Suppose a microorganism is discovered to be 
lacking pyruvate kinase. Nevertheless, the organism does me- 
tabolize glucose to pyruvate. Surprisingly, it does so with a net 
gain of four molecules of ATP (or ATP equivalents) per mole- 
cule of metabolized glucose. Looking at the Conceptual In- 
sights module on the energetics of glucose metabolism, suggest 
a biochemical pathway for metabolism of glucose in this organ¬ 
ism. Why might evolution have favored the normal glycolytic 
pathway rather than this alternative? 

22. Oxidation in the absence of oxygen. The energetics of glycol¬ 
ysis section of the Conceptual Insights module shows that 
there is a large free energy drop upon oxidation of glyceralde¬ 
hyde 3-phosphate to 1,3-bisphosphoglycerate (reaction 6). In the 
presence of oxygen, some to this energy is ultimately converted 
into ATP production. However, no such conversion happens 
under anaerobic conditions. Explain why. 















The Citric Acid Cycle 



Roundabouts, or traffic círcies, function as hubs to facilítate traffíc flow. 

The citric acid cycle is the biochemical hub of the cell, oxidizing carbon fuels, usually in 
the form of acetyl CoA, as well as serving as a source of precursors for biosynthesis. 
[(Above) Chris Warren/International Stock.] 



2CO2 


From Chapter 16 , we know that glucose can be metabolized to pyruvate 
anaerobically to synthesize ATP through the glycolytic pathway. Glycoly- 
sis, however, harvests but a fraction of the ATP available from glucose. We 
now begin an exploration of the aerobic processing of glucose, which is the 
source of most of the ATP generated in metabolism. The aerobic process¬ 
ing of glucose starts with the complete oxidation of glucose derivatives to 
carbon dioxide. This oxidation takes place in the citric acid cycle, a series of 
reactions also known as the tricarboxylic acid (TCA) cycle or the Krebs 
cycle. The citric acid cycle is the final common pathway for 
the oxidation offuel molecules —amino acids, fatty acids, and 
carbohydrates. Most fuel molecules enter the cycle as acetyl 
coenzyme A. 
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FIGURE 17.1 Mitochondríon. The double 
membrane of the mitochondrion is evident 
in this electron micrograph. The numerous 
invaginations of the inner mitochondrial 
membrane are called cristae. The oxidative 
decarboxylation of pyruvate and the 
sequence of reactions in the citric add 
cyde take place within the matrix. [(Left) 
Omikron/Photo Researchers.] 



FIGURE 17.2 OverView of the citric 
acid cycle. The dtric acid cyde oxidizes 
two-carbon units, producing two molecules 
of CO 2 , one molecule of GTP, and high- 
energy electrons in the form of NADH 
and FADH 2 . 


FIGURE 17.3 Cellular respiration. The 

citric acid cycle constitutes the first stage 
in cellular respiration, the removal of high- 
energy electrons from carbon fueis (left). 
These electrons reduce O 2 to generate a 
proton gradient (middie), which is used 
to synthesize ATP (right). The reduction 
of O 2 and the synthesis of ATP constitute 
oxidative phosphorylation. 
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Under aerobic conditions, the pyruvate generated from glucose is ox- 
idatively decarboxylated to form acetyl Co A. In eukaryotes, the reactions 
of the citric acid cycle take place inside mitochondria, in contrast with those 
of glycolysis, which take place in the cytosol (Figure 17.1). 

17.0.1 An OverView of the Citric Acid Cycle 

The citric acid cycle is the central metabolic hub of the cell. It is the gate- 
way to the aerobic metabolism of any molecule that can be transformed into 
an acetyl group or dicarboxylic acid. The cycle is also an important source 
of precursors, not only for the storage forms of fueis, but also for the build- 
ing blocks of many other molecules such as amino acids, nucleotide bases, 
cholesterol, and porphyrin (the organic component of heme). 

What is the function of the citric acid cycle in transforming fuel mole¬ 
cules into ATP? Recall that fuel molecules are carbon compounds that are 
capable of being oxidized—of losing electrons (Chapter 14). The citric acid 
cycle includes a series of oxidation-reduction reactions that result in the ox- 
idation of an acetyl group to two molecules of carbon dioxide. 

The overall pattern of the citric acid cycle is shown in Figure 17.2. A 
four- carbon compound (oxaloacetate) condenses with a two-carbon acetyl 
unit to yield a six-carbon tricarboxylic acid (citrate). An isomer of citrate is 
then oxidatively decarboxylated. The resulting five-carbon compound 
(a-ketoglutarate) also is oxidatively decarboxylated to yield a four-carbon 
compound (succinate). Oxaloacetate is then regenerated from succinate. 
Two carbon atoms enter the cycle as an acetyl unit and two carbon atoms 
leave the cycle in the form of two molecules of carbon dioxide. Three hy- 
dride ions (hence, six electrons) are transferred to three molecules of nicoti- 
namide adenine dinucleotide (NAD^), whereas one pair of hydrogen atoms 
(hence, two electrons) is transferred to one molecule of flavin adenine din¬ 
ucleotide (FAD). The function of the citric acid cycle is the harvesting ofhigh- 
energy electrons from carbon fueis. Note that the citric acid cycle itself nei- 
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ther generates a large amount of ATP nor includes oxygen as a reactant 
(Figure 17.3). Instead, the citric acid cycle removes electrons from acetyl 
Co A and uses these electrons to form NADH and FADH 2 . In oxidative 
phosphorylation (Chapter 18), electrons released in the reoxidation of 
NADH and FADH 2 flow through a series of membrane proteins (referred 
to as the electron-transport chain) to generate a proton gradient across the 
membrane. These protons then flow through ATP synthase to generate 
ATP from ADP and inorganic phosphate. Oxygen is required for the cit¬ 
ric acid cycle indirectly inasmuch as it is the electron acceptor at the end of 
the electron-transport chain, necessary to regenerate NAD^ and FAD. 

The citric acid cycle, in conjunction with oxidative phosphorylation, 
provides the vast majority of energy used by aerobic cells—in human be- 
ings, greater than 95%. It is highly efficient because a limited number of 
molecules can generate large amounts of NADH and FADH 2 . Note in 
Figure 17.2 that the four-carbon molecule, oxaloacetate, that initiates the 
first step in the citric acid cycle is regenerated at the end of one passage 
through the cycle. The oxaloacetate acts catalytically: it participates in the 
oxidation of the acetyl group but is itself regenerated. Thus, one molecule 
of oxaloacetate is capable of participating in the oxidation of many acetyl 
molecules. 


17.1 THE CITRIC ACID CYCLE OXIDIZES 
TWO-CARBON UNITS 


Acetyl CoA is the fuel for the citric acid cycle. This important molecule is 
formed from the breakdown of glycogen (the storage form of glucose), fats, 
and many amino acids. Indeed, as we will see in Chapter 22, fats contain 
strings of reduced two-carbon units that are first oxidized to acetyl CoA 
and then completely oxidized to CO 2 by the citric acid cycle. 

17.1.1 The Formation of Acetyl Coenzyme A from Pyruvate 

The formation of acetyl CoA from carbohydrates is less direct than from 
fat. Recall that carbohydrates, most notably glucose, are processed by gly- 
colysis into pyruvate (Chapter 16). Under anaerobic conditions, the pyru¬ 
vate is converted into lactic acid or ethanol, depending on the organism. 
Under aerobic conditions, the pyruvate is transported into mitochondria in 
exchange for OH~ by the pyruvate carrier, an antiporter (Section 13.4). In 
the mitochondrial matrix, pyruvate is oxidatively decarboxylated by the 
pyruvate dehydrogenase complex to form acetyl CoA. 

Pyruvate + CoA + NAD^-> acetyl CoA + CO 2 + NADH 

This irreversible reaction is the link between glycolysis and the citric acid cy¬ 
cle. (Figure 17.4) Note that, in the preparation of the glucose derivative 
pyruvate for the citric acid cycle, an oxidative decarboxylation takes place 
and high-transfer-potential electrons in the form of NADH are captured. 
Thus, the pyruvate dehydrogenase reaction has many of the key features of 
the reactions of the citric acid cycle itself. 

The pyruvate dehydrogenase complex is a large, highly integrated com¬ 
plex of three kinds of enzymes (Table 17.1). Pyruvate dehydrogenase is a 
member of a family of homologous complexes that includes the citric acid 
cycle enzyme a-ketoglutarate dehydrogenase (Section 17.1.6), a branched- 
chain a-ketoacid dehydrogenase, and acetoin dehydrogenase, found in cer- 
tain prokaryotes. These complexes are giant, with molecular masses rang- 
ing from 4 to 10 million daltons (Figure 17.5). As we will see, their elaborate 
structures allow groups to travei from one active site to another, connected 
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FIGURE 17.4 The link between 
glycolysis and the citric acid cycle. 

Pyruvate produced by glycolysis is 
converted into acetyl CoA, the fuel of 
the citric acid cycle. 



FIGURE 17.5 Electron micrograph of 
the pyruvate dehydrogenase complex 
from E. COÍL [Courtesy of Dr. Lester Reed.] 
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TABLE 17.1 Pyruvate dehydrogenase complex of E. coli 


Enzyme 

Abbreviation 

Number 
of chains 

Prosthetic 

group 

Reaction catalyzed 

Pyruvate dehydrogenase 
component 

El 

24 

TPP 

Oxidative decarboxylation 
of pyruvate 

Dihydrolipoyl transacetylase 

E2 

24 

Lipoamide 

Transfer of the acetyl 
group to GoA 

Dihydrolipoyl dehydrogenase 

E3 

12 

FAD 

Regeneration of the 
oxidized form of 
lipoamide 


by tethers to the core of the structure. The mechanism of the pyruvate de¬ 
hydrogenase reaction is wonderfully complex, more so than is suggested by 
its relatively simple stoichiometry. The reaction requires the participation 
of the three enzymes of the pyruvate dehydrogenase complex, each com- 
posed of several polypeptide chains, and five coenzymes: thiamine py- 
rophosphate (TPP), lipoic acid, and FAD serve as catalytic cofactors, and 
Co A and NAD^ are stoichiometric cofactors. 



Thiamine pyrophosphate (TPP) 



O 

Lipoic acid 


At least two additional enzymes regulate the activity of the complex. 

The conversion of pyruvate into acetyl CoA consists of three steps: de- 
carboxylation, oxidation, and transfer of the resultant acetyl group to CoA. 




CoA 


Transfer to CoA 


-CoA 


Acetyl CoA 


These steps must be coupled to preserve the free energy derived from the 
decarboxylation step to drive the formation of NADH and acetyl CoA. 
First, pyruvate combines with TPP and is then decarboxylated (Figure 
17.6). This reaction is catalyzed by the pyruvate dehydrogenase component 
(El) of the multienzyme complex. A key feature of TPP, the prosthetic group 
of the pyruvate dehydrogenase component, is that the carbon atom between 
the nitrogen and sulfur atoms in the thiazole ring is much more acidic than 
most =CH— groups, with a value near 10. This center ionizes to form 
a carbanion, which readily adds to the carbonyl group of pyruvate. 
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FIGURE 17.6 Mechanísm of the decarboxylatíon reactíon of E|, the pyruvate 
dehydrogenase component of the pyruvate dehydrogenese complex. 


This addition is followed by the decarboxylatíon of pyruvate. The positively 
charged ring of TPP acts as an electron sink that stabilizes the negative 
charge that is transferred to the ring as part of the decarboxylatíon. Proto- 
nation yields hydroxyethyl-TPP. 

Second, the hydroxyethyl group attached to TPP is oxidized to form an 
acetyl group and concomitantly transferred to lipoamide, a derivative of 
lipoic acid that is linked to the side chain of a lysine residue by an amide 
linkage. 


R' 



(ionized form) 



The oxidant in this reactíon is the disulfide group of lipoamide, which is 
reduced to its disulfhydryl form. This reactíon, also catalyzed by the pyru¬ 
vate dehydrogenase component Ei, yields acetyllipoamide. 

Third, the acetyl group is transferred from acetyllipoamide to CoA to form 
acetyl CoA. 



Reactive disulfide bond 


Lipoamide 




Acetyl CoA Dihydrolipoamide 


Dihydrolipoyl transacetylase (E 2 ) catalyzes this reactíon. The energy-rich 
thioester bond is preserved as the acetyl group is transferred to CoA. Re- 
call that CoA serves as a carrier of many activated acyl groups, of which 
acetyl is the simplest (Section 14.3.1). Acetyl CoA, the fuel for the citric 
acid cycle, has now been generated from pyruvate. 

The pyruvate dehydrogenase complex cannot complete another catalytic 
cycle until the dihydrolipoamide is oxidized to lipoamide. In a fourth step, 
the oxidized form of lipoamide is regenerated by dihydrolipoyl dehydrogenase 
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FIGURE 17.7 Schematic representatíon 
of the pyruvate dehydrogenase 
complex. The transacetylase core (E 2 ) is 
shown in red, the pyruvate dehydrogenase 
component (Ei) in yellow, and the 
dihydrolipoyl dehydrogenase (E3) in green. 


FIGURE 17.8 Structure of the 
transacetylase (E 2 ) core. Each red bali 
represents a trimer of three E 2 subunits. 
Each subunit consists of three domains: a 
lipoamide-binding domain, a small 
domain for interaction with E 3 , and a large 
transacetylase catalytic domain. All three 
subunits of the transacetylase domain are 
shown in the ribbon representation, with 
one depicted in red. 


(E 3 ). Two electrons are transferred to an FAD prosthetic group of the en- 
zyme and then to NAD^. 



Dihydrolipoamide Lipoamide 


This electron transfer to FAD is unusual, because the common role for FAD 
is to receive electrons from NADH. The electron transfer potential of FAD 
is altered by its association with the enzyme and enables it to transfer elec¬ 
trons to NAD^. Proteins tightly associated with FAD or flavin mononu- 
cleotide (FMN) are cslled flavoproteins. 


17.1.2 Flexible Linkages Allow Lipoamide to Move 
Between Different Active Sites 

Although the structure of an intact member of the pyruvate dehydrogenase 
complex family has not yet been determined in atomic detail, the structures 
of all of the component enzymes are now known, albeit from different com- 
plexes and species. Thus, it is now possible to construct an atomic model of 
the complex to understand its activity (Figure 17.7). 

The core of the complex is formed by E 2 . Acetyltransferase consists of 
eight catalytic trimers assembled to form a hollow cube. Each of the three 
subunits forming a trimer has three major domains (Figure 17.8). At the 
amino terminus is a small domain that contains a bound lipoamide cofac- 
tor attached to a lysine residue. This domain is homologous to biotin-bind- 
ing domains such as that of pyruvate carboxylase (see Figure 16.26). The 
lipoamide domain is followed by a small domain that interacts with E 3 
within the complex. A larger transacetylase domain completes an E 2 sub¬ 
unit. El is an a 2 ^2 tetramer, and E 3 is a a|3 dimer. Twenty-four copies of 



Lipoamide 
domain i 


Lipoamide 


Domain 
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E 3 component 


Transacetylase 
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FIGURE 17.9 Reactions of the pyruvate dehydrogenase complex. At the top (center), 
the enzyme (represented by a yellow, a blue, and two red spheres) is unmodified and 
ready for a catalytic cycle. (1) Pyruvate is decarboxylated to form the hydroxyethyl TPP. 

(2) The dihydrolipoyl arm of E 2 moves into the active site of E]. (3) Ei catalyzes the transfer 
of the two-carbon group to the dihydrolipoyl group to form the acetyl-lipoyl complex. 

(4) E 2 catalyzes the transfer of the acetyl moiety to CoA to form the product acetyl CoA. 

The disulfhydryl lipoyl arm then swings to the active site of E 3 . E 3 catalyzes (5) the 
reduction of the lipoic acid and ( 6 ) the transfer of the protons and electrons to NAD^ to 
complete the reaction cycle. 


El and 12 copies of E 3 surround the E 2 core. How do the three distinct ac¬ 
tive sites Work in concert (Figure 17.9)? 

1 . Pyruvate is decarboxylated at the active site of Ei, forming the substi- 
tuted TPP intermediate, and CO 2 leaves as the first product. This active 
site lies within the Ei complex, connected to the enzyme surface by a 
20 -A-long hydrophobic channel. 

2 . E 2 inserts the lipoyl-lysine arm of the lipoamide domain into the chan¬ 
nel in El. 

3. El catalyzes the transfer of the acetyl group to the lipoamide. The acety- 
lated lipoyl-lysine arm then leaves Ei and enters the E 2 cube through 30 A 
Windows on the sides of the cube to visit the active site of E 2 , located deep 
in the cube at the subunit interface. 

4. The acetyl moiety is then transferred to CoA, and the second product, 
acetyl CoA, leaves the cube. The reduced lipoyl-lysine arm then swings to 
the active site of the E 3 flavoprotein. 

5. At the E 3 active site, the lipoamide acid is oxidized by coenzyme FAD. 

6 . The final product, NADH, is produced with the reoxidation of FADH 2 , 
and the reactivated lipoamide is ready to begin another reaction cycle. 
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other increases the overall reaction rate and minimizes side reactions. All the 
intermediates in the oxidative decarboxylation of pyruvate are tightly bound 
to the complex and are readily transferred because of the ability of the lipoyl- 
lysine arm of E 2 to call on each active site in turn. 

17.1.3 Citrate Synthase Forms Citrate from Oxaloacetate and 
Acetyl Coenzyme A 

The citric acid cycle begins with the condensation of a four-carbon unit, ox¬ 
aloacetate, and a two-carbon unit, the acetyl group of acetyl CoA. Ox¬ 
aloacetate reacts with acetyl CoA and H 2 O to yield citrate and CoA. 
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FIGURE 17.10 Conformatíonal 
changes in citrate synthase on bínding 
oxaloacetate. The small domain of 
each subunit of the homodimer is shown 
in yellow; the large domain is shown in 
blue. (Left) Open form of enzyme alone. 
(Right) Closed form of the liganded enzyme. 


This reaction, which is an aldol condensation followed by a hydrolysis, is 
catalyzed by citrate synthase. Oxaloacetate first condenses with acetyl CoA 
to form citryl CoA, which is then hydrolyzed to citrate and CoA. The hy¬ 
drolysis of citryl CoA, a high-energy thioester intermediate, drives the 
overall reaction far in the direction of the synthesis of citrate. In essence, 
the hydrolysis of the thioester powers the synthesis of a new molecule from 
two precursors. Because this reaction initiates the cycle, it is very impor- 
tant that side reactions be minimized. Let us briefly consider the how cit¬ 
rate synthase prevents wasteful processes such as the hydrolysis of acetyl 
CoA. 

Mammalian citrate synthase is a dimer of identical 49-kd subunits. Each 
active site is located in a cleft between the large and small domains of a sub¬ 
unit, adjacent to the subunit interface. The results of x-ray crystallographic 
studies of citrate synthase and its complexes with several substrates and in- 
hibitors revealed that the enzyme undergoes large conformational changes 
in the course of catalysis. Citrate synthase exhibits sequential, ordered 











kinetics: oxaloacetate binds first, followed by acetyl CoA. The reason for 
the ordered binding is that oxaloacetate induces a major structural rearrange- 
ment leading to the creation of a binding site for acetyl CoA. The open form 
of the enzyme observed in the absence of ligands is converted into a closed 
form by the binding of oxaloacetate (Figure 17.10). In each subunit, the 
small domain rotates 19 degrees relative to the large domain. Movements as 
large as 15 A are produced by the rotation of a helices elicited by quite small 
shifts of side chains around bound oxaloacetate. This conformational transi- 
tion is reminiscent of the cleft closure in hexokinase induced by the bind¬ 
ing of glucose (Section 16.1.1). 

Citrate synthase catalyzes the condensation reaction by bringing the sub- 
strates into dose proximity, orienting them, and polarizing certain bonds. 
Two histidine residues and an aspartate residue are important players 
(Figure 17.11). One of the histidine residues (His 274) donates a proton to 
the carbonyl oxygen of acetyl CoA to promote the removal of a methyl pro¬ 
ton by Asp 375. Oxaloacetate is activated by the transfer of a proton from 
His 320 to its carbonyl carbon atom. The concomitant attack of the enol of 
acetyl CoA on the carbonyl carbon of oxaloacetate results in the formation 
of a carbon-carbon bond. The newly formed citryl CoA induces additional 
structural changes in the enzyme. The active site becomes completely en- 
closed. His 274 participates again as a proton donor to hydrolyze the 
thioester. Coenzyme A leaves the enzyme, followed by citrate, and the en¬ 
zyme returns to the initial open conformation. 

We can now understand how the wasteful hydrolysis of acetyl CoA is 
prevented. Citrate synthase is well suited to hydrolyze citryl CoA but not 
acetyl CoA. How is this discrimination accomplished? First, acetyl CoA 
does not bind to the enzyme until oxaloacetate is bound and ready for con¬ 
densation. Second, the catalytic residues crucial for hydrolysis of the 
thioester linkage are not appropriately positioned until citryl CoA is formed. 
As with hexokinase (Section 16.1.1) and triose phosphate isomerase (Sec¬ 
tion 16.1.4), induced fit prevents an undesirable side reaction. 




17.1.4 Citrate Is Isomerized into Isocitrate 

The tertiary hydroxyl group is not properly located in the citrate molecule for 
the oxidative decarboxylations that follow. Thus, citrate is isomerized into iso¬ 
citrate to enable the six-carbon unit to undergo oxidative decarboxylation. 
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FIGURE 17.11 Mechanísm of synthesis 
of citryl CoA by citrate synthase. The 

condensation of oxaloacetate and acetyl 
CoA proceeds through an enol intermediate. 
The subsequent hydrolysis of citryl CoA 
yields citrate and CoA. 



Citryl CoA complex 
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The isomerization of citrate is accomplished by a dehydration step followed 
by a hydration step. The result is an interchange of a hydrogen atom and a 
hydroxyl group. The enzyme catalyzing both steps is called aconitase be- 
cause cis-aconitate is an intermediate. 
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Aconitase is an iron—sulfur protein, or nonheme iron protein. It contains four 
iron atoms that are not incorporated as part of a heme group. The four iron 
atoms are complexed to four inorganic sulfides and three cysteine sulfur 
atoms, leaving one iron atom available to bind citrate and then isocitrate 
through their carboxylate and hydroxyl groups (Figure 17.12). This iron 
center, in conjunction with other groups on the enzyme, facilitates the de¬ 
hydration and rehydration reactions. We will consider the role of these 
iron-sulfur clusters in the electron-transfer reactions of oxidative phospho- 
rylation subsequently (Section 18.3.1). 


FIGURE 17.12 Bíndíng of citrate to the 
iron-sulfur complex of aconitase. A 

4Fe-4S iron-sulfur cluster is a component 
of the active site of aconitase. One of the 
iron atoms of the cluster binds to the 
carboxylate and hydroxyl groups of citrate. 



Y The iron-sulfur cluster in aconitase is somewhat unstable, so one or 
more iron atoms dissociate under conditions of low iron availability 
in the cell. Remarkably, this sensitivity to iron levei was exploited in the 
evolution of a mechanism for regulating gene expression in response to iron 
leveis, as will be discussed in Chapter 31. 

17.1.5 Isocitrate Is Oxidized and Decarboxylated 
to Oi -Ketogl u ta rate 

We come now to the first of four oxidation-reduction reactions in the cit¬ 
ric acid cycle. The oxidative decarboxylation of isocitrate is catalyzed by 
isocitrate dehydrogenase. 

Isocitrate + NAD^-> a-ketoglutarate + CO 2 + NADH 

The intermediate in this reaction is oxalosuccinate, an unstable |3-ketoacid. 
While bound to the enzyme, it loses CO 2 to form a-ketoglutarate. 
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The rate of formation of a-ketoglutarate is important in determining the 
overall rate of the cycle, as will be discussed in Section 17.2.2. This oxida- 
tion generates the first high-transfer-potential electron carrier NADH in 
the cycle. 

17.1.6 Succinyl Coenzyme A Is Formed by the Oxidative 
Decarboxylation of a-Ketoglutarate 

The conversion of isocitrate into a-ketoglutarate is followed by a second 
oxidative decarboxylation reaction, the formation of succinyl CoA from 
a-ketoglutarate. 

-OOC. .0 CoA—S o 

CH 2 

> I + CO 2 + NADH 

CH 2 

COO- coo- 

a-Ketoglutarate Succinyl CoA 

Y The oxidative decarboxylation of a-ketoglutarate closely resembles 
that of pyruvate, also an a-ketoacid. 

Pyruvate + CoA + NAD^-> acetyl CoA + CO 2 NADH 

Both reactions include the decarboxylation of an a-ketoacid and the subse- 
quent formation of a high-transfer-potential thioester linkage with CoA. 
The complex that catalyzes the oxidative decarboxylation of a-ketoglutarate 
is homologous to the pyruvate dehydrogenase complex, and the reaction 
mechanism is entirely analogous. The a-ketoglutarate dehydrogenase com- 
ponent (E 2 ) and transsuccinylase (Ei) are different from but homologous to 
the corresponding enzymes in the pyruvate dehydrogenase complex, 
whereas the dihydrolipoyl dehydrogenase components (E 3 ) of the two com- 
plexes are identical. 

17.1.7 A High Phosphoryl-Transfer Potential Compound Is 
Generated from Succinyl Coenzyme A 

Succinyl CoA is an energy-rich thioester compound. The 
AG °' for the hydrolysis of succinyl CoA is about — 8 kcal 
moD^ ( — 33.5 kj moD^), which is comparable to that of 
ATP ( — 7.3 kcal moP\ or —30.5 kJ moP^). In the ci- 
trate synthase reaction, the cleavage of the thioester bond 
powers the synthesis of the six-carbon citrate from the 
four-carbon oxaloacetate and the two-carbon fragment. 

The cleavage of the thioester bond of succinyl CoA is cou- 
pled to the phosphorylation of a purine nucleoside diphosphate, usually GDP. 
This reaction is catalyzed by succinyl CoA synthetase (succinate thiokinase). 
This enzyme is an a 2|32 heterodimer; the functional unit is one a|3 pair. The 
mechanism is a clear example of energy transformations: energy inherent in 
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FIGURE 17.13 Reaction mechanísm 
of succínyl CoA synthetase. The 

formation of GTP at the expense of 
succinyl CoA is an example of substrate- 
level phosphorylation. The reaction 
proceeds through a phosphorylated 
enzyme intermediate. 


the thioester molecule is transformed into phosphoryl-group transfer po- 
tential (Figure 17.13). The first step is the displacement of coenzyme A by 
orthophosphate, which generates another energy-rich compound, succinyl 
phosphate. A histidine residue of the a subunit removes the phosphoryl 
group with the concomitant generation of succinate and phosphohistidine. 
The phosphohistidine residue then swings over to a bound nucleoside 
diphosphate and the phosphoryl group is transferred to form the nucleoside 
triphosphate. The participation of high-energy compounds in all the steps 
is attested to by the fact that the reaction is readily reversible: AG°' = — 0.8 
kcal moP^ ( —3.4 kj moP^). This is the only step in the citric acid cycle that 
directly yields a compound with high phosphoryl transfer potential through 
a substrate-levei phosphorylation. Some mammalian succinyl CoA syn- 
thetases are specific for GDP and others for ADR The E. coli enzyme uses 
either GDP or ADP as the phosphoryl-group acceptor. We have already 
seen that GTP is an important component of signal-transduction systems 
(Chapter 15). Alternatively, its y-phosphoryl group can be readily trans¬ 
ferred to ADP to form ATP, in a reaction catalyzed by nucleoside diphos- 
phokinase. 

GTP + ADP GDP + ATP 

r The mechanism of succinyl CoA synthetase reveals that a phospho¬ 
ryl group is transferred first to succinyl CoA bound in the a subunit 
and then to a nucleoside diphosphate bound in the |3 subunit. Examination 
of the three-dimensional structure of succinyl CoA synthetase shows that 
each subunit comprises two domains (Figure 17.14). The carboxyl-terminal 
domains of the two subunits are similar to one another, whereas the amino- 
terminal domains have different structures, each characteristic of its role in 
the mechanism. The amino-terminal domain of the a subunit forms a Ross- 
mann fold (Section 16.1.10), which binds the ADP component of succinyl 
CoA, whereas the amino-terminal domain of the (3 subunit is an ATP-grasp 
domain, a nucleotide-activating domain found in many enzymes, especially 
those catalyzing purine biosynthesis (Section 16.3.2 and Chapter 25). Suc¬ 
cinyl CoA synthetase has evolved by adopting these domains and harness- 
ing them to allow the capture of the energy associated with succinyl CoA 
cleavage to drive the generation of a nucleoside triphosphate. 
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FIGURE 17.14 Structure of 
succínyl CoA synthetase. The enzyme is 
composed of two subunits. The a subunit 
contains a Rossmann fold that binds the 
ADP component of CoA, and the p 
subunit contains a nucleotide-activating 
region called the ATP-grasp domain. The 
ATP-grasp domain is shown here binding a 
molecule of ADP. The histidine residue picks 
up the phosphoryl group from near the 
CoA and swings over to transfer it to the 
nucleotide bound in the ATP-grasp domain. 


17.1.8 Oxaloacetate Is Regenerated by the Oxidation 
of Succinate 

Reactions of four-carbon compounds constitute the final stage of the citric 
acid cycle: the regeneration of oxaloacetate. 


COO" FAD 

FADH2 

XOO" 

H20 

COO" 

1 -C-H V 



V. 

HO -C-H 

1- C-H 

-OOC^ 

1 


H- C-H 

COO" 




COO" 

Succinate 

Fumarate 


Malate 


NAD+ 


NADH + H+ 




H- 


xoo- 


-H 


coo- 

Oxaloacetate 


The reactions constitute a metabolic motif that we will see again in fatty 
acid synthesis and degradation as well as in the degradation of some amino 
acids (see Figure 14.17). A methylene group (CH 2 ) is converted into a car- 
bonyl group (C = O) in three steps: an oxidation, a hydration, and a sec- 
ond oxidation reaction. Not only is oxaloacetate thereby regenerated for an- 
other round of the cycle, but also more energy is extracted in the form of 
FADH 2 and NADH. 

Succinate is oxidized to fumarate by succinate dehydrogenase. The hy- 
drogen acceptor is FAD rather than NAD^, which is used in the other three 
oxidation reactions in the cycle. In succinate dehydrogenase, the isoallox- 
azine ring of FAD is covalently attached to a histidine side chain of the en¬ 
zyme (denoted E-FAD). 

E-FAD + succinate ^ E-FADH 2 + fumarate 

FAD is the hydrogen acceptor in this reaction because the free-energy 
change is insufficient to reduce NAD^. FAD is nearly always the electron 
acceptor in oxidations that remove two hydrogen atoms from a substrate. 

Succinate dehydrogenase, like aconitase, is an iron-sulfur protein. Indeed, 
succinate dehydrogenase contains three different kinds of iron-sulfur clus- 
ters, 2Fe-2S (two iron atoms bonded to two inorganic sulfides), 3Fe-4S, and 
4Fe-4S. Succinate dehydrogenase— ^which consists of two subunits, one 70 kd 
and the other 27 kd—differs from other enzymes in the citric acid cycle in be- 
ing embedded in the inner mitochondrial membrane. In fact, succinate dehy¬ 
drogenase is directly associated with the electron-transport chain, the link be- 
tween the citric acid cycle and ATP formation. FADH 2 produced by the 
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oxidation of succinate does not dissociate from the enzyme, in contrast with 
NADH produced in other oxidation-reduction reactions. Rather, two elec- 
trons are transferred from FADH 2 directly to iron-sulfur clusters of the en¬ 
zyme. The ultimate acceptor of these electrons is molecular oxygen, as we 
shall see in Chapter 18. 

The next step is the hydration of fumarate to form L-malate. Fumarase 
catalyzes a stereospecific trans addition of a hydrogen atom and a hydroxyl 
group. The hydroxyl group adds to only one side of the double bond of fu¬ 
marate; hence, only the L isomer of malate is formed. 

Finally, malate is oxidized to form oxaloacetate. This reaction is cat- 
alyzed by malate dehydrogenase, and NAD^ is again the hydrogen acceptor. 

Malate + NAD^ v oxaloacetate + NADH + 

Note that the standard free energy for this reaction, unlike that for the other 
steps in the citric acid cycle, is significantly positive. The oxidation of malate 
is driven by the utilization of the products—oxaloacetate by citrate synthase 
and NADH by the electron-transport chain. 

17.1.9 Stoichiometry of the Citric Acid Cycle 

The net reaction of the citric acid cycle is: 

Acetyl CoA + 3 NAD+ + FAD + GDP + Pi + 2 H 2 O-> 

2 CO 2 + 3 NADH + FADH 2 + GTP + 2 H+ + CoA 
Let us recapitulate the reactions that give this stoichiometry (Figure 17.15 
and Table 17.2): 


NADH 


NAD+ 



coo- 


coo- 

I 

HO—C—H 

I 

CH 2 

I 

coo- 


-00c- 


Citrate 


NAD+ 



NADH + H+ 
+ CO2 


FIGURE 17.15 The citric acid cycle. 
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TABLE 17.2 Citric acid cycle 


Step 

Reaction 

Enzyme 

Prosthetic 

group 

Type^ 

AG' 

0 f 

kcal mol~^ 

kj mol”^ 

1 

Acetyl CoA + oxaloacetate + H 2 O-> 

Citrate synthase 


a 

-7.5 

-31.4 


citrate + CoA + H"^ 






2a 

Citrate ^ ds-aconitate + H 2 O 

Aconitase 

Fe-S 

b 

+ 2.0 

+ 8.4 

2b 

ds-Aconitate + H 2 O ^ isocitrate 

Aconitase 

Fe-S 

c 

-0.5 

-2.1 

3 

Isocitrate + NAD ^ 

Isocitrate 


d + e 

-2.0 

-8.4 


a-ketoglutarate + CO 2 + NADH 

dehydrogenase 





4 

a-Ketoglutarate + NAD + CoA v 

a-Ketoglutarate 

Lipoic acid, 

d + e 

-7.2 

-30.1 


succinyl CoA + CO 2 + NADH 

dehydrogenase 

FAD, TPP 






complex 





5 

Succinyl CoA + Pi + GDP ^ 

Succinyl CoA 


f 

-0.8 

-3.3 


succinate + GTP + GoA 

synthetase 





6 

Succinate + FAD (enzyme-bound) ^ 

Succinate 

FAD, Fe-S 

e 

^0 

0 


fumarate + FADH 2 (enzyme-bound) 

dehydrogenase 





7 

Fumarate + H 2 O ^ L- malate 

Fumarase 


c 

-0.9 

-3.8 

8 

L-Malate + NAD"^ t" 

Malate 


e 

+ 7.1 

+ 29.7 


oxaloacetate + NADH + H"^ 

dehydrogenase 






^Reaction type: (a) condensation; (b) dehydration; (c) hydration; (d) decarboxylation; 
(e) oxidation; (f) substrate-level phosphorylation. 


1 . Two carbon atoms enter the cycle in the condensation of an acetyl unit 
(from acetyl CoA) with oxaloacetate. Two carbon atoms leave the cycle in 
the form of CO 2 in the successive decarboxylations catalyzed by isocitrate 
dehydrogenase and a-ketoglutarate dehydrogenase. Interestingly, the re- 
sults of isotope-labeling studies revealed that the two carbon atoms that en¬ 
ter each cycle are not the ones that leave. 

2. Four pairs of hydrogen atoms leave the cycle in four oxidation reactions. 
Two molecules of NAD^ are reduced in the oxidative decarboxylations of 
isocitrate and n-ketoglutarate, one molecule of FAD is reduced in the oxi¬ 
dation of succinate, and one molecule of NAD^ is reduced in the oxidation 
of malate. 

3. One compound with high phosphoryl transfer potential, usually GTP, 
is generated from the cleavage of the thioester linkage in succinyl CoA. 

4. Two molecules of water are consumed: one in the synthesis of citrate by 
the hydrolysis of citryl CoA and the other in the hydration of fumarate. 

Recall also that NADH is generated in the formation of acetyl CoA from 
pyruvate by the pyruvate dehydrogenase reaction. 

The efficiency of the citric acid cycle may be enhanced by the arrange- 
ment of the constituent enzymes. Evidence is accumulating that the en- 
zymes are physically associated with one another to facilitate substrate chan- 
neling between active sites. The word metabolon has been suggested as the 
name for such multienzyme complexes. 

As will be discussed in Chapter 18, the electron-transport chain oxidizes 
the NADH and FADH 2 formed in the citric acid cycle. The transfer of elec- 
trons from these carriers to O 2 , the ultimate electron acceptor, leads to the 
generation of a proton gradient across the inner mitochondrial membrane. 
This proton-motive force then powers the generation of ATP; the net 
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FIGURE 17.16 From glucose to acetyl 
CoA. The synthesis of acetyl CoA by the 
pyruvate dehydrogenase complex is a key 
irreversible step in the metabolism of 
glucose. 


FIGURE 17.17 Regulatíon of the 
pyruvate dehydrogenase complex. 

The complex is inhibited by its immediate 
Products, NADH and acetyl CoA. The 
pyruvate dehydrogenase component is 
aiso regulated by covalent modification. 

A specific kinase phosphorylates and 
inactivates pyruvate dehydrogenase, and 
a phosphatase actives the dehydrogenase 
by removing the phosphoryl. The kinase 
and the phosphatase aIso are highly 
regulated enzymes. 


stoichiometry is about 2.5 ATP per NADH, and 1.5 ATP per FADH 2 . 
Consequently, 9 high-transfer-potential phosphoryl groups are generated 
when the electron-transport chain oxidizes 3 molecules of NADH and 1 
molecule of FADH 2 , and 1 high-transfer-potential phosphoryl group per 
acetyl unit is directly formed in the citric acid cycle. Thus, 1 acetate unit 
generates approximately 10 molecules of ATP. In dramatic contrast, only 2 
molecules of ATP are generated per molecule of glucose (which generates 
2 molecules of acetyl CoA) by anaerobic glycolysis. 

Recall that molecular oxygen does not participate directly in the citric 
acid cycle. However, the cycle operates only under aerobic conditions be- 
cause NAD^ and FAD can be regenerated in the mitochondrion only by 
the transfer of electrons to molecular oxygen. Glycolysis has both an aerobic 
and an anaerobic mode, whereas the citric acid cycle is strictly aerobic. Gly¬ 
colysis can proceed under anaerobic conditions because NAD^ is regener¬ 
ated in the conversion of pyruvate into lactate. 


17.2 ENTRY TO THE CITRIC ACID CYCLE AND 
METABOLISIVI THROUGH IT ARE CONTROLLED 


The citric acid cycle is the final common pathway for the aerobic oxidation 
of fuel molecules. Moreover, as we will see shortly (Section 17.3) and re- 
peatedly elsewhere in our study of biochemistry, the cycle is an important 
source of building blocks for a host of important biomolecules. As befits its 
role as the metabolic hub of the cell, entry into the cycle and the rate of the 
cycle itself are controlled at several stages. 

17.2.1 The Pyruvate Dehydrogenase Complex Is Regulated 
Allosterically and by Reversible Phosphorylation 

As we saw earlier, glucose can be formed from pyruvate (Section 16.3). 
However, the formation of acetyl CoA from pyruvate is an irreversible step in 
animais and thus they are unable to convert acetyl CoA back into glucose. The 
oxidative decarboxylation of pyruvate to acetyl CoA commits the carbon 
atoms of glucose to two principal fates: oxidation to CO 2 by the citric acid 
cycle, with the concomitant generation of energy, or incorporation into lipid 
(Figure 17.16). As expected of an enzyme at a criticai branch point in me¬ 
tabolism, the activity of the pyruvate dehydrogenase complex is stringently 
controlled by several means (Figure 17.17). High concentrations of reaction 
Products of the complex inhibit the reaction: acetyl CoA inhibits the 
transacetylase component (E 2 ), whereas NADH inhibits the dihydrolipoyl 
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covalent modification of the pyruvate dehydrogenase component. Phos- Control of the Citric Add Cyde 

phorylation of the pyruvate dehydrogenase component (Ei) by a specific kinase 

switches off the activity of the complex. Deactivation is reversed by the action 

of a specific phosphatase. The site of phosphorylation is the transacetylase 

component (E 2 ), again highlighting the structural and mechanistic impor- 

tance of this core. Increasing the NADH/NAD^, acetyl CoA/CoA, or 

ATP/ADP ratio promotes phosphorylation and, hence, deactivation of the 

complex. In other words, high concentrations of immediate (acetyl Co A and 

NADH) and ultimate (ATP) products inhibit the activity. Thus, pyruvate 

dehydrogenase is switched off when the energy charge is high and biosynthetic 

intermediates are abundant. On the other hand, pyruvate as well as ADP (a 

signal of low energy charge) activate the dehydrogenase by inhibiting the 

kinase. 

In contrast, ai-adrenergic agonists and hormones such as vasopressin 
stimulate pyruvate dehydrogenase by triggering a rise in the cytosolic Ca^^ 
levei (Section 15.3.4), which in turn elevates the mitochondrial Ca^^ levei. 

The rise in mitochondrial Ca^^ activates the pyruvate dehydrogenase com¬ 
plex by stimulating the phosphatase. Insulin also accelerates the conversion 
of pyruvate into acetyl CoA by stimulating the dephosphorylation of the 
complex. In turn, glucose is funneled into pyruvate. 


7 The importance of this covalent control is illustrated in people with 
a phosphatase deficiency. Because pyruvate dehydrogenase is always 
phosphorylated and thus inactive, glucose is processed to lactic acid. This 
condition results in unremitting lactic acidosis (high blood leveis of lactic 
acid), which leads to the malfunctioning of many tissues, most notably the 
central nervous system (Section 17.3.2). 


17.2.2 The Citric Acid Cycle Is Controlled at Several Points 

The rate of the citric acid cycle is precisely adjusted to meet an animal 
cells needs for ATP (Figure 17.18). The primary control points are the al- 
losteric enzymes isocitrate dehydrogenase and a-ketoglutarate dehydrogenase. 

Isocitrate dehydrogenase is allosterically stimulated by ADP, which en- 
hances the enzyme's affinity for substrates. The binding of isocitrate, NAD^, 
Mg^^, and ADP is mutually cooperative. In contrast, NADH inhibits iso¬ 
citrate dehydrogenase by directly displacing NAD^. ATP, too, is inhibitory. 
It is important to note that several steps in the cycle require NAD^ or FAD, 
which are abundant only when the energy charge is low. 

A second control site in the citric acid cycle is a-ketoglutarate dehydro¬ 
genase. Some aspects of this enzyme^s control are like those of the pyru¬ 
vate dehydrogenase complex, as might be expected from the homology 
of the two enzymes. a-Ketoglutarate dehydrogenase is inhibited by suc- 
cinyl CoA and NADH, the products of the reaction that it catalyzes. In 
addition, a-ketoglutarate dehydrogenase is inhibited by a high energy 
charge. Thus, the rate of the cycle is reduced when the cell has a high levei 
of ATP. 

In many bactéria, the funneling of two-carbon fragments into the 
cycle also is controlled. The synthesis of citratefrom oxaloacetate and acetyl 
CoA carbon units is an important control point in these organisms. ATP 
is an allosteric inhibitor of citrate synthase. The effect of ATP is to in- 
crease the value of Km for acetyl CoA. Thus, as the levei of ATP increases, 
less of this enzyme is saturated with acetyl CoA and so less citrate is 
formed. 
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FIGURE 17.18 Control of the citric acid 
cycle. The citric acid cycle is regulated 
primarily by the concentration of ATP 
and NADH. The key control points are 
the enzymes isocitrate dehydrogenase 
and a-ketoglutarate dehydrogenase. 
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17.3 THE CITRIC ACID CYCLE IS A SOURCE 
OF BIOSYNTHETIC PRECURSORS 


FIGURE 17.19 Biosynthetic roles of 
the citric acid cycle. Intermediates drawn 
off for biosyntheses (shown by red arrows) 
are replenished by the formation of 
oxaloacetate from pyruvate. 


Thus far, discussion has focused on the citric acid cycle as the major degrada- 
tive pathway for the generation of ATP. As a major metabolic hub of the cell, 
the citric acid cycle also provides intermediates for biosyntheses (Figure 17.19). 
For example, most of the carbon atoms in porphyrins come from succinyl 
CoA. Many of the amino acids are derived from a-ketoglutarate and oxa¬ 
loacetate. These biosynthetic processes will be discussed in subsequent 
chapters. 
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17.3.1 The Citric Acid Cycle Must Be Capable of Being Rapidiy 
Replenished 

The important point now is that citric acid cycle intermediates must be re¬ 
plenished if any are drawn off for biosyntheses. Suppose that much oxaloac¬ 
etate is converted into amino acids for protein synthesis and, subsequently, 
the energy needs of the cell rise. The citric acid cycle will operate to a re- 
duced extent unless new oxaloacetate is formed, because acetyl CoA can- 
not enter the cycle unless it condenses with oxaloacetate. Even though ox¬ 
aloacetate is recycled, a minimal levei must be maintained to allow the cycle 
to function. 

How is oxaloacetate replenished? Mammals lack the enzymes for the net 
conversion of acetyl CoA into oxaloacetate or any other citric acid cycle 
intermediate. Rather, oxaloacetate is formed by the carboxylation of pyru¬ 
vate, in a reaction catalyzed by the biotin-dependent enzyme pyruvate 
carboxylase. 


Pyruvate + CO 2 + ATP + H 2 O-> oxaloacetate + ADP + P^ + 2 


Recall that this enzyme plays a crucial role in gluconeogenesis (Section 
16.3.2). It is active only in the presence of acetyl CoA, which signifies the 
need for more oxaloacetate. If the energy charge is high, oxaloacetate is con¬ 
verted into glucose. If the energy charge is low, oxaloacetate replenishes the 
citric acid cycle. The synthesis of oxaloacetate by the carboxylation of pyru¬ 
vate is an example of an anaplerotic reaction (of Greek origin, meaning to 
‘Till up''), a reaction that leads to the net synthesis, or replenishment, of 
pathway components. Note that, because the citric acid cycle is a cycle, it 
can be replenished by the generation of any of the intermediates. 






17.3.2 The Disruption of Pyruvate Metabolism Is the Cause 
of Beriberi and Poisoning by Mercury and Arsenic 

T Beriberi, a neurologic and cardiovascular disorder, is caused by a 
dietary deficiency of thiamine (also called vitamin Bi). The disease 
has been and continues to be a serious Health problem in the Far East be- 
cause rice, the major food, has a rather low content of thiamine. This defi¬ 
ciency is partly ameliorated if the whole rice grain is soaked in water before 
milling—some of the thiamine in the husk then leaches into the rice kernel. 
The problem is exacerbated if the rice is polished, because only the outer 
layer contains significant amounts of thiamine. Beriberi is also occasionally 
seen in alcoholics who are severely malnourished and thus thiamine defi- 
cient. The disease is characterized by neurologic and cardiac symptoms. 
Damage to the peripheral nervous system is expressed as pain in the limbs, 
weakness of the musculature, and distorted skin sensation. The heart may 
be enlarged and the cardiac output inadequate. 

Which biochemical processes might be affected by a deficiency of thi¬ 
amine? Thiamine pyrophosphate is the prosthetic group of three important 
enzymes: pyruvate dehydrogenase, a-ketoglutarate dehydrogenase, and trans- 
ketolase. Transketolase functions in the pentose phosphate pathway, which 
will be discussed in Chapter 20. The common feature of enzymatic reactions 
utilizing TPP is the transfer of an activated aldehyde unit. In beriberi, the 
leveis of pyruvate and a-ketoglutarate in the blood are higher than nor¬ 
mal. The increase in the levei of pyruvate in the blood is especially pro- 
nounced after the ingestion of glucose. A related finding is that the activ- 
ities of the pyruvate and a-ketoglutarate dehydrogenase complexes in 
vivo are abnormally low. The low transketolase activity of red cells in 
beriberi is an easily measured and reliable diagnostic indicator of the 
disease. 

Why does TPP deficiency lead primarily to neurological disorders? The 
nervous system relies essentially on glucose as its only fuel. In contrast, most 
other tissues can use fats as a source of fuel for the citric acid cycle. The 
product of aerobic glycolysis, pyruvate, can enter the citric acid cycle only 
through the pyruvate dehydrogenase complex. 

Symptoms similar to those of beriberi arise if an organism is exposed to 
mercury or arsenite (AsOa^”). Both elements have a high affinity for neigh- 
boring sulfhydryls, such as those in the reduced dihydrolipoyl groups of the 
dihydrolipoyl dehydrogenase component of the pyruvate dehydrogenase 
complex (Figure 17.20). The binding of mercury or arsenite to the dihydro¬ 
lipoyl groups inhibits the complex and leads to central nervous system 
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Beriberi— 

A vitamin-deficiency disease first 
described in 1630 by Jacob Bonitus, 
a Dutch physician working in Java: 

"A certain very troublesome affliction, 
which attacks men, is called by the 
inhabitants Beriberi (which means 
sheep). I believe those, whom this 
same disease attacks, with their knees 
shaking and the legs raised up, walk 
like sheep. It is a kind of paralysis, or 
rather Tremor: for it penetrates the 
motion and sensation of the hands 
and feet indeed sometimes of the 
whole body." 


FIGURE 17.20 Arsenite poisoning. 

Arsenite inhibits the pyruvate 
dehydrogenase complex by inactivating 
the dihydrolipoamide component of the 
transacetylase. Some sulfhydryl reagents, 
such as 2,3-dimercaptoethanol, relieve the 
inhibition by forming a complex with the 
arsenite that can be excreted. 
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[The Granger Collection.] 


The manuscript proposing the citric 
acid cycle was submitted for 
publication to Nature but was rejected. 
It was subsequentiy published in 
Enzymologia. Dr. Krebs proudiy 
displayed the rejection letter 
throughout his career as 
encouragement for young scientists. 

"June 1937 

The editor of NATURE presents his 
compliments to Dr. H. A. Krebs and 
regrets that as he has aiready sufficient 
letters to fili the correspondence 
columns of NATURE for seven or eight 
weeks, it is undesirable to accept 
further letters at the present time on 
account of the time delay which must 
occur in their publication. 

If Dr. Krebs does not mind much delay 
the editor is prepared to keep the 
letter until the congestion is relieved in 
the hope of making use of it. 

He returns it now, in case Dr. Krebs 
prefers to submit it for early 
publication to another periodical." 


pathologies. The proverbial phrase ‘‘mad as a hatter'' refers to the strange 
behavior of poisoned hat makers who used mercury nitrate to soften and 
shape animal furs. This form of mercury is absorbed through the skin. Sim¬ 
ilar problems afflicted the early photographers, who used vaporized mer¬ 
cury to create daguerreotypes. 

Treatment for these poisons is the administration of sulfhydryl reagents 
with adjacent sulfhydryl groups to compete with the dihydrolipoyl residues 
for binding with the metal ion, which is then excreted in the urine. Indeed, 
2,3-dimercaptopropanol (see Figure 17.20) was developed after World War 
I as an antidote to lewisite, an arsenic-based chemical weapon. This com- 
pound was initially called BAL, for British anti-lewisite. 


17.3.3 Speculations on the Evolutionary History of the 
Citric Acid Cycle 

Y How did the citric acid cycle come into being? Although definitive 
answers are elusive, it is nevertheless instructive to speculate how this 
complicated central hub of metabolism developed. We can perhaps begin to 
comprehend how evolution might work at the levei of biochemical pathways. 

It is most likely that the citric acid cycle was assembled from preex- 
isting reaction pathways. As noted earlier, many of the intermediates 
formed in the citric acid cycle are used in biosynthetic pathways to gen- 
erate amino acids and porphyrins. Thus, compounds such as pyruvate, 
a-ketoglutarate, and oxaloacetate were likely present early in evolution for 
biosynthetic purposes. The oxidative decarboxylation of these a-ketoacids 
is quite favorable thermodynamically. The elegant modular structures of 
the pyruvate and a-ketoglutarate dehydrogenase complexes reveal how 
three reactions (decarboxylation, oxidation, and thioester formation) can 
be linked to harness the free energy associated with decarboxylation to 
drive the synthesis of both acyl CoA derivatives and NADH. These re¬ 
actions almost certainly formed the core of processes that preceded the cit¬ 
ric acid cycle evolutionarily. Interestingly, a-ketoglutarate can be directly 
converted into oxaloacetate by transamination of the respective amino 
acids by aspartate aminotransferase, another key biosynthetic enzyme. 
Thus, cycles comprising smaller numbers of intermediates could have ex- 
isted before the present form evolved to harvest the electrons from pyru¬ 
vate or other compounds more efficiently. 


17.4 THE GLYOXYLATE CYCLE ENABLES PLANTS AND 
BACTÉRIA TO GROW ON ACETATE 


Many bactéria and plants are able to subsist on acetate or other compounds 
that yield acetyl CoA. They make use of a metabolic pathway absent in most 
other organisms that converts two-carbon acetyl units into four-carbon 
units (succinate) for energy production and biosyntheses. This reaction se- 
quence, called the glyoxylate cycle, bypasses the two decarboxylation steps 
of the citric acid cycle. Another key difference is that two molecules of acetyl 
CoA enter per turn of the glyoxylate cycle, compared with one in the citric 
acid cycle. 

The glyoxylate cycle (Figure 17.21), like the citric acid cycle, begins with 
the condensation of acetyl CoA and oxaloacetate to form citrate, which is 
then isomerized to isocitrate. Instead of being decarboxylated, isocitrate is 
cleaved by isocitrate lyase into succinate and glyoxylate. The subsequent 
steps regenerate oxaloacetate from glyoxylate. Acetyl CoA condenses with 
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FIGURE 17.21 The glyoxylate pathway. The glyoxylate cycle allows plants and some 
microorganisms to grow on acetate because the cycle bypasses the decarboxylation steps of 
the citric acid cycle. The enzymes that permit the conversion of acetate into succinate— 
isocitrate lyase and malate synthase—are boxed in blue. 


glyoxylate to form malate in a reaction catalyzed by malate synthase, which 
resembles citrate synthase. Finally, malate is oxidized to oxaloacetate, as in 
the citric acid cycle. The sum of these reactions is: 

2 Acetyl CoA + NAD+ + 2 H 2 O-> 

succinate + 2 CoA + NADH + 2 

In plants, these reactions take place in organelles called glyoxysomes. Succi¬ 
nate, released midcycle, can be converted into carbohydrates by a combi- 
nation of the citric acid cycle and gluconeogenesis. Thus, organisms with 
the glyoxylate cycle gain a metabolic versatility. 

Bactéria and plants can synthesize acetyl CoA from acetate and CoA by 
an ATP-driven reaction that is catalyzed by acetyl CoA synthetase. 

Acetate + CoA + ATP-> acetyl CoA + AMP + PP^ 

Pyrophosphate is then hydrolyzed to orthophosphate, and so the equiv- 
alents of two compounds having high phosphoryl transfer potential are con- 
sumed in the activation of acetate. We will return to this type of activation 
reaction in fatty acid degradation (Section 22.2.2), where it is used to form 
fatty acyl CoA, and in protein synthesis, where it is used to link amino acids 
to transfer RNAs (Section 29.x.x). 







SUMMARY 
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The citric acid cycle is the final common pathway for the oxidation of 
fuel molecules. It also serves as a source of building blocks for biosyn- 
theses. Most fuel molecules enter the cycle as acetyl CoA. The link be- 
tween glycolysis and the citric acid cycle is the oxidative decarboxyla- 
tion of pyruvate to form acetyl CoA. In eukaryotes, this reaction and 
those of the cycle take place inside mitochondria, in contrast with gly¬ 
colysis, which takes place in the cytosol. 

The Citric Acid Cycle Oxidizes Two-Carbon Units 

The cycle starts with the condensation of oxaloacetate (C4) and acetyl 
CoA (C2) to give citrate (Ce), which is isomerized to isocitrate (Ce). 
Oxidative decarboxylation of this intermediate gives a-ketoglutarate 
(C5). The second molecule of carbon dioxide comes off in the next 
reaction, in which a-ketoglutarate is oxidatively decarboxylated to suc- 
cinyl CoA (C4). The thioester bond of succinyl CoA is cleaved by in- 
orthophosphate to yield succinate, and a high phosphoryl transfer po- 
tential compound in the form of GTP is concomitantly generated. 
Succinate is oxidized to fumarate (C4), which is then hydrated to form 
malate (C4). Finally, malate is oxidized to regenerate oxaloacetate (C4). 
Thus, two carbon atoms from acetyl CoA enter the cycle, and two 
carbon atoms leave the cycle as CO 2 in the successive decarboxylations 
catalyzed by isocitrate dehydrogenase and a-ketoglutarate dehydro- 
genase. In the four oxidation-reduction reactions in the cycle, three 
pairs of electrons are transferred to NAD^ and one pair to FAD. These 
reduced electron carriers are subsequently oxidized by the electron- 
transport chain to generate approximately 9 molecules of ATP. In 
addition, 1 molecule of a compound having a high phosphoryl transfer 
potential is directly formed in the citric acid cycle. Hence, a total of 
10 molecules of compounds having high phosphoryl transfer potential 
are generated for each two-carbon fragment that is completely oxidized 
to H 2 O and CO 2 . 

Entry to the Citric Acid Cycle and Metabolism Through It Are Controlled 

The citric acid cycle operates only under aerobic conditions because it 
requires a supply of NAD^ and FAD. The irreversible formation of 
acetyl CoA from pyruvate is an important regulatory point for the en¬ 
try of glucose-derived pyruvate into the citric acid cycle. The activity 
of the pyruvate dehydrogenase complex is stringently controlled by re- 
versible phosphorylation. The electron acceptors are regenerated when 
NADH and FADH 2 transfer their electrons to O 2 through the electron- 
transport chain, with the concomitant production of ATP. Conse- 
quently, the rate of the citric acid cycle depends on the need for ATP. 
In eukaryotes, the regulation of two enzymes in the cycle also is im¬ 
portant for control. A high energy charge diminishes the activities of 
isocitrate dehydrogenase and a-ketoglutarate dehydrogenase. These 
mechanisms complement each other in reducing the rate of formation 
of acetyl CoA when the energy charge of the cell is high and when 
biosynthetic intermediates are abundant. 

The Citric Acid Cycle Is a Source of Biosynthetic Precursors 

When the cell has adequate energy available, the citric acid cycle can 
also provide a source of building blocks for a host of important bio- 
molecules, such as nucleotide bases, proteins, and heme groups. This 
use depletes the cycle of intermediates. When the cycle again needs to 
metabolize fuel, anaplerotic reactions replenish the cycle intermediates. 






Selected Readings 


• The Glyoxylate Cycle Enables Plants and Bactéria to Grow on Acetate 

The glyoxylate cycle enhances the metabolic versatility of many plants 
and bactéria. This cycle, which uses some of the reactions of the citric 
acid cycle, enables these organisms to subsist on acetate because it 
bypasses the two decarboxylation steps of the citric acid cycle. 
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-1 PROBLEMS I- 

1. Flow of carbon atoms. What is the fate of the radioactive la¬ 
bei when each of the foliowing compounds is added to a cell ex- 
tract containing the enzymes and cofactors of the glycolytic 
pathway, the citric acid cycle, and the pyruvate dehydrogenase 
complex? (The labei is printed in red.) 
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(a) C (b) C 

XOO- XOO- 
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X. 


XOO- 
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(e) Glucose 6-phosphate labeled at C-1. 


as does thiamine pyrophosphate, and it competitively inhibits 
the enzyme. Why? 


H.C 


R' 



TPP 



Thiazolone analog 
of TPP 


7. Lactic acidosis. Patients in shock will often suffer from lac- 
tic acidosis due to a deficiency of O 2 . Why does a lack of O 2 lead 
to lactic acid accumulation? One treatment for shock is to ad- 
minister dichloroacetate, which inhibits the kinase associated 
with the pyruvate dehydrogenase complex. What is the bio- 
chemical rationale for this treatment? 


2. C2 + C2-> C4. 

(a) Which enzymes are required to get net synthesis of oxaloac- 
etate from acetyl CoA? 

(b) Write a balanced equation for the net synthesis. 

(c) Do mammalian cells contain the requisite enzymes? 

3. Driving force. What is the AG°' for the complete oxidation 
of the acetyl unit of acetyl CoA by the citric acid cycle? 

4. Acting catalytically. The citric acid cycle itself, which is com- 
posed of enzymatically catalyzed steps, can be thought of essen- 
tially as the product of a supramolecular enzyme. Explain. 

5. Probing stereospecificity. A sample of deuterated reduced 
NAD was prepared by incubating H3C-CD2-OH and NAD^ 
with alcohol dehydrogenase. This reduced coenzyme was added 
to a solution of 1,3-BPG and glyceraldehyde 3-phosphate dehy¬ 
drogenase. The NAD^ formed by this second reaction contained 
one atom of deuterium, whereas glyceraldehyde 3-phosphate, 
the other product, contained none. What does this experiment 
reveal about the stereospecificity of glyceraldehyde 3-phosphate 
dehydrogenase? 

6. A potent inhibitor. Thiamine thiazolone pyrophosphate 
binds to pyruvate dehydrogenase about 20,000 times as strongly 


8. Coupling reactions. The oxidation of malate by NAD^ to 
form oxaloacetate is a highly endergonic reaction under standard 
conditions [AG°' = + 7 kcal moP^ (+ 29 kj moP^)]. The re¬ 
action proceeds readily under physiological conditions. 

(a) Why? 

(b) Assuming an [NAD^]/[NADH] ratio of 8 and a pH of 7, 
what is the lowest [malate]/[oxaloacetate] ratio at which ox¬ 
aloacetate can be formed from malate? 

9. Synthesizing a-ketoglutarate. It is possible, with the use of 
the reactions and enzymes discussed in this chapter, to convert 
pyruvate into a-ketoglutarate without depleting any of the cit¬ 
ric acid cycle components. Write a balanced reaction scheme for 
this conversion, showing cofactors and identifying the required 
enzymes. 

Chapter Integration Problem 

10. Fats into glucose? Fats are usually metabolized into acetyl 
CoA and then further processed through the citric acid cycle. In 
Chapter 16, we learned that glucose could be synthesized from 
oxaloacetate, a citric acid cycle intermediate. Why, then, after 
a long bout of exercise depletes our carbohydrate Stores, do 
we need to replenish those Stores by eating carbohydrates? 
Why do we not simply replace them by converting fats into 
carbohydrates? 





Mechanism Problems 

11. Theme and variation. Propose a reaction mechanism for the 
condensation of acetyl Co A and glyoxylate in the glyoxylate cy- 
cle of plants and bactéria. 

12. Symmetry problems. In experiments carried out in 1941 to in- 
vestigate the citric acid cycle, oxaloacetate labeled with in 
the carboxyl carbon atom farthest from the keto group was in- 
troduced to an active preparation of mitochondria. 


CH2 

COO- 

Oxaloacetate 

Analysis of the a-ketoglutarate formed showed that none of the 
radioactive labei had been lost. Decarboxylation of a-ketoglu¬ 
tarate then yielded succinate devoid of radioactivity. All the labei 
was in the released CO 2 . Why were the early investigators of the 
citric acid cycle surprised that all the labei emerged in the CO 2 ? 

13. Symmetric molecules reacting asymmetrically The interpreta- 
tion of the experiments described in problem 12 was that citrate 
(or any other symmetric compound) cannot be an intermediate 
in the formation of a-ketoglutarate, because of the asymmetric 
fate of the labei. This view seemed compelling until Alexander 
Ogston incisively pointed out in 1948 that ‘ht is possible that an 
asymmetric enzyme which attacks a symmetrical compound can dis- 
tinguish between its identical groups.'' For simplicity, consider a 
molecule in which two hydrogen atoms, a group X, and a dif- 
ferent group Y are bonded to a tetrahedral carbon atom as a 
model for citrate. Explain how a symmetric molecule can react 
with an enzyme in an asymmetric way. 

Data Interpretation 

14. A little goes a long way. As will become clearer in Chapter 
18, the activity of the citric acid cycle can be monitored by mon- 
itoring the amount of O 2 consumed. The greater the rate of O 2 
consumption, the faster the rate of the cycle. Hans Krebs used 
this assay to investigate the cycle in 1937. He used as his exper¬ 
imental System minced pigeon breast muscle, which is rich in 
mitochondria. In one set of experiments, Krebs measured the O 2 
consumption in the presence of carbohydrate only and in the 
presence of carbohydrate and citrate. The results are shown in 
the foliowing table. 


Effect of citrate on oxygen consumption by minced 
pigeon breast muscle 


Time 

(min) 

Micromoles of 

oxygen consumed 

Carbohydrate 

only 

Carbohydrate 
plus 3 ixmol 
of citrate 

10 

26 

28 

60 

43 

62 

90 

46 

77 

150 

49 

85 


Problems - 1 489 |- 

(a) How much O 2 would be absorbed if the added citrate were 
completely oxidized to H 2 O and CO 2 ? 

(b) Based on your answer to part u, what do the results given in 
the table suggest? 

15. Arsenite poisoning. The effect of arsenite on the experimen¬ 
tal System of problem 14 was then examined. Experimental data 
(not presented here) showed that the amount of citrate present 
did not change in the course of the experiment in the absence of 
arsenite. However, if arsenite was added to the system, different 
results were obtained, as shown in the following table. 


Disappearance of citric acid in pigeon breast muscle 
in the presence of arsenite 


Micromoles 

Micromoles of 

Micromoles 

of citrate 

citrate found after 

of citrate 

added 

40 minutes 

used 

22 

0.6 

21 

44 

20 

24 

90 

56 

34 


(a) What is the effect of arsenite on the disappearance of 
citrate? 

(b) How is the arsenite’s action altered by the addition of more 
citrate? 

(c) What do these data suggest about the site of action of 
arsenite? 

16. Isocitrate lyase and tuberculosis. The bacterium Mycobact- 
erium tuberculosis, the cause of tuberculosis, can invade the lungs 
and persist in a latent State for years. During this time, the bac¬ 
téria reside in granulomas—nodular scars containing bactéria 
and host-cell debris in the center and surrounded by immune 
cells. The granulomas are lipid-rich, oxygen-poor environments. 
How these bactéria manage to persist is something of a mystery. 
The results of recent research suggest that the glyoxylate cycle 
is required for the persistence. The following data show the 
amount of bactéria [presented as colony-forming units (cfu)] in 
mice lungs in the weeks after an infection. 

In graph A, the black circles represent the results for wild- 
type bactéria and the red circles represent the results for bacté¬ 
ria from which the gene for isocitrate lyase was deleted. 


(A) 



(a) What is the effect of the absence of isocitrate lyase? 

The techniques discussed in Chapter 6 were used to reinsert the 
gene encoding isocitrate lyase into bactéria from which it had 
previously been deleted. 
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In graph B, Black circles represent bactéria into which the 
gene was reinserted and red circles bactéria from which the gene 
was still missing. 

(b) Do these results support those obtained in part < 2 ? 

(c) What is the purpose of the experiment in part b? 

(d) Why do these bactéria perish in the absence of the glyoxy- 
late cycle? 


% Need extra help? Purchase chapters of the Student 
Companion with complete Solutions online at 
www.whfreeman.com/biochem5. 


(B) Isocitrate lyase 



[Data after McKinney et al., 2000. Nature 406:735-738.] 
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The Calvin Cycle and the Pentose 
Phosphate Pathway 




Year 

Atmospheríc carbon dioxide measurements at Mauna Loa, 
Hawaii. These measurements show annual cycles resulting from 
seasonal variation in carbon dioxide fixation by the Calvin cycle 
in terrestrial plants. Much of this fixation takes place in rain forests, 
which account for approximately 50% of terrestrial fixation. 

[Dennis Potokar/Photo Researchers.] 



OUTLINE 


Photosynthesis proceeds in two parts: the light reactions and the dark reac- 
tions. The light reactions, which were discussed in Chapter 19, transform 
light energy into ATP and biosynthetic reducing power, NADPH. The dark 
reactions, which constitute the Calvin cycle, named after Melvin Calvin, 
the biochemist who elucidated the pathway, reduce carbon atoms from their 
fully oxidized State as carbon dioxide to the more reduced State as a hexose. 

The components of the Calvin cycle and called the dark re- _ 

actions because, in contrast with the light reactions, these 
reactions do not directly depend on the presence of light. 

In addition to ATP, the dark reactions require reducing 
power in the form of NADPH, the currency of readily avail- 
able reducing power in cells. The phosphoryl group on the 
2'-hydroxyl group of one of the ribose units of NADPH 
distinguishes NADPH from NADH. There is a funda¬ 
mental distinction between NADPH and NADH in biochem- 
istry: NADH is oxidized by the respiratory chain to generate 
ATP, whereas NADPH serves as a reductant in biosynthetic 
processes. 

The second half of this chapter examines a pathway 
common to all organisms, known variously as the pentose 
phosphate pathway, the hexose monophosphate pathway, 
the phosphogluconate pathway, or the pentose shunt. The 
pathway provides a means by which glucose can be oxidized 
to generate NADPH and is the source of much of the 
NADPH that is needed for the biosynthesis of many bio- 
molecules, most notably fats. We will observe the use of 


20.1 The Calvin Cycle Synthesizes 
Hexoses from Carbon Dioxide and Water 

20.2 The Activity of the Calvin Cycle 
Depends on Environmental Conditions 

20.3 The Pentose Phosphate Pathway 
Generates NADPH and Synthesizes 
Five-Carbon Sugars 

20.4 The Metabolism of Glucose 6- 
phosphate by the Pentose Phosphate 
Pathway Is Coordinated with Glycolysis 

20.5 Glucose 6-phosphate 
Dehydrogenase Plays a Key Role 
in Protection Against Reactive 
Oxygen Species 
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CHAPTER 20 • The Calvin Cycle and the 

Pentose Phosphate Pathway 


NADPH in many of the biosynthetic reactions considered in Part III of this 
text. The pentose phosphate pathway can also be used for the catabolism of 
pentose sugars from the diet, the synthesis of pentose sugars for nucleotide 
biosynthesis, and the catabolism and synthesis of less common four- and 
seven-carbon sugars. The pentose phosphate pathway and the Calvin cycle 
have in common several enzymes and intermediates that attest to an evolu- 
tionary kinship. Like glycolysis and gluconeogenesis (Chapter 16), these 
pathways are mirror images of one another: the Calvin cycle uses NADPH 
to reduce carbon dioxide to generate hexoses, whereas the pentose phosphate 
pathway breaks down glucose into carbon dioxide to generate NADPH. 
The Calvin cycle is sometimes referred to as the reductive pentose phosphate 
pathway. 


20.1 THE CALVIN CYCLE SYNTHESIZES HEXOSES 
FROIVI CARBON DIOXIDE AND WATER 

We saw in Chapter 16 that glucose could be formed from noncarbohydrate 
precursors, such as lactate and amino acids, by gluconeogenesis. The syn¬ 
thesis of glucose from these compounds is simplified because the carbons 
are already incorporated into relatively complex organic molecules. In con- 
trast, the source of the carbon atoms in the Calvin cycle is the simple mol- 
ecule carbon dioxide. In this extremely important process, carbon dioxide 
gas is trapped in a form that is useful for many processes. The Calvin cycle 
brings into living systems the carbon atoms that will become constituents 
of nucleic acids, proteins, and fats. Photosynthetic organisms are called 
autotrophs (literally ‘‘self-feeders’') because they can synthesize glucose from 
carbon dioxide and water, by using sunlight as an energy source, and then 
reco ver some of this energy from the synthesized glucose through the gly- 
colytic pathway and aerobic metabolism. Organisms that obtain energy 
from Chemical fuels only are called heterotrophs, which ultimately depend 
on autotrophs for their fuel. The Calvin cycle also differs from gluconeo¬ 
genesis in where it takes place in photosynthetic eukaryotes. Whereas glu¬ 
coneogenesis takes place in the cytoplasm, the Calvin cycle takes place in 
the stroma of chloroplasts, the photosynthetic organelles. 


FIGURE 20.1 Calvin cycle. The Calvin 
cycle consists of three stages. Stage 1 is 
the fixation of carbon by the carboxylation 
of ribulose 1,5-bisphosphate. Stage 2 is 
the reduction of the fixed carbon to begin 
the synthesis of hexose. Stage 3 is the 
regeneration of the starting compound, 
ribulose 1,5-bisphosphate. 



\ 

\ 

\ 

\ 

\ 

\ 
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Ribulose 

5-phosphate 


Fructose 

6-phosphate 


À, 


STAGE 2: Reduction 


Glyceraldehyde 

3-phosphate 


ATP 


Ribulose 

1,5-bisphosphate 


2 

3-phospho- 

glycerate 



1,3-bisphospho- 

glycerate 


2 ATP 



2 NADPH 


STAGE 1: 
Fixation 
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The Calvin cycle comprises three stages (Figure 20.1): 

1. The fixation of CO 2 by ribulose 1,5 -bisphosphate to form two molecules 
of 3-phosphoglycerate. 

2. The reduction of 3-phosphoglycerate to form hexose sugars. 

3. The regeneration of ribulose 1,5-bisphosphate so that more CO 2 can be 
fixed. 

Although we will focus on the Calvin cycle, other means of fixing carbon 
dioxide into hexose sugars exist in the photosynthetic world, notably a ver- 
sion of the citric acid cycle running in rever se. 
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The Calvin Cycle 


20.1.1 Carbon Dioxide Reacts with Ribulose 1,5-bisphosphate 
to Form Two Molecules of 3-Phosphoglycerate 

The first step in the Calvin cycle is the fixation of CO 2 . The CO 2 molecule 
condenses with ribulose 1,5-bisphosphate to form an unstable six-carbon com- 
pound, which is rapidly hydrolyzed to two molecules of 3-phosphoglycerate. 


CH 20 P 032 - 

CH 2 OPO 32 - 


C=0 CO 2 

HO-C- COO- H 2 O 

CH 2 OPO 32 

1 1 /- r\i 1 — 



H L UH 

— > L - U - 

^ MU U M 

H—C—OH 

H—C-OH 

CO2- 

CH 2 OPO 32 - 

CH 2 OPO 32 - 


Ribulose 

Unstable 

3-Phosphoglycerate 

1,5-bisphosphate 

intermediate 



The initial incorporation of CO 2 into 3-phosphoglycerate was revealed 
through the use of a carbon-14 radioactive tracer (Figure 20.2). This highly 
exergonic reaction [AG°' = —12.4 kcal moP^ ( — 51.9 kj moP^)] is cat- 
alyzed by ribulose 1,5-bisphosphate carboxylase/oxygenase (usually called 
rubisco), an enzyme located on the stromal surface of the thylakoid mem- 
branes of chloroplasts. This important reaction is the rate-limiting step in 
hexose synthesis. Rubisco in chloroplasts consists of eight large (L, 55-kd) 
subunits and eight small (S, 13-kd) ones (Figure 20.3). Each L chain con- 
tains a catalytic site and a regulatory site. The S chains enhance the catalytic 


S chain L chain 



3-Phosphoglycerate 



5 s 



FIGURE 20.2 Tracing the fate of 
carbon dioxide. Radioactivity from ^^C 02 
is incorporated into 3-phosphoglycerate 
within 5 s in irradiated cultures of algae. 
After 60 s, the radioactivity appears in 
many compounds, the intermediates 
within the Calvin cycle. [Courtesy of 
Dr. J. A. Bassham.] 


FIGURE 20.3 Structure of 
rubisco. The enzyme ribulose 
1,5-bisphosphate carboxylase/oxygenase 
(rubisco) comprises eight large subunits 
(one shown in red and the others in 
yellow) and eight small subunits (one 
shown in blue and the others in white). 
The active sites lie in the large subunits. 
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Glu 

n n 



FIGURE 20.4 Role of the magnesium 
íon ín the rubísco mechanism. 

Ribulose 1,5-bisphosphate binds to a 
magnesium ion that is linked to rubisco 
through a glutamate residue, an aspartate 
residue, and the lysine carbamate. The 
coordinated ribulose 1,5-bisphosphate 
gives up a proton to form a reactive 
enediolate species that reacts with CO 2 
to form a new carbon-carbon bond. 




activity of the L chains. This enzyme is very abundant in chloroplasts, con- 
stituting more than 16% of their total protein. In fact, rubisco is the most 
abundant enzyme and probably the most abundant protein in the biosphere. 
Large amounts are present because rubisco is a slow enzyme; its maximal 
catalytic rate is only 3 s~b 

Rubisco requires a bound divalent metal ion for activity, usually mag¬ 
nesium ion. Like the zinc ion in the active site of carbonic anhydrase (Sec- 
tion 9.2.1), this metal ion serves to activate a bound substrate molecule by 
stabilizing a negative charge. Interestingly, a CO 2 molecule other than the 
substrate is required to complete the assembly of the Mg^^ binding site in 
rubisco. This CO 2 molecule adds to the uncharged e-amino group of lysine 
201 to form a carbamate. This negatively charged adduct then binds the 
Mg^^ ion. The formation of the carbamate is facilitated by the enzyme 
rubisco activase, although it will also form spontaneously at a lower rate. 

The metal center plays a key role in binding ribulose 1,5-bisphosphate 
and activating it so that it will react with CO 2 (Figure 20.4). Ribulose 1,5- 
bisphosphate binds to Mg^^ through its keto group and an adjacent hy- 
droxyl group. This complex is readily deprotonated to form an enediolate 
intermediate. This reactive species, analogous to the zinc-hydroxide species 
in carbonic anhydrase (Section 9.2.2), couples with CO 2 , forming the new 


CH2OPO32- 

I 

c=o 

I 

H—C—OH 

I 

H—C—OH 

I 

CH20P03^“ 


H+ 



CH2OPO32- 

I 

c—o- 

II 

C—OH 


CO 2 



H—C—OH 

I 

CH 2 OPO 32 - 


Ribulose 

1,5-bisphosphate 


Enediolate 

intermediate 


CH2OPO32- 

I 

HO—C— COO" 

I 

c=o 

I 

H—C—OH 

I 

CH 2 OPO 32 - 



2-Carboxy-3-keto- 

D-arabinitol 

1,5-bisphosphate 


CH 2 OPO 32 - 

I 

HO—C— COO" H 
HO —C— OH 

I 

H—C—OH 

I 

CH 2 OPO 32 - 

Hydrated 
intermediate 


CH2OPO32- 
HO—C— H 
COO" 

3-Phosphoglycerate 

+ 

COO- 

I 

H C—OH 

I 

CH2OPO32- 

3-Phosphoglycerate 


2 H+ 


FIGURE 20.5 Formation of 3-phosphoglycerate. The overall pathway for the conversion 
of ribulose 1,5 bisphosphate and CO 2 into two molecules of 3-phosphoglycerate. Although 
the free species are shown, these steps take place on the magnesium ion. 
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carbon-carbon bond. The resulting product is coordinated to the ion 

through three groups, including the newly formed carboxylate. A molecule 
of H 2 O is then added to this |3-ketoacid to form an intermediate that cleaves 
to form two molecules of 3-phosphoglycerate (Figure 20.5). 


20.1.2 Catalytic Imperfection: Rubisco Aiso Catalyzes 
a WastefuI Oxygenase Reaction 

The reactive intermediate generated on the Mg^^ ion sometimes reacts with 
O 2 instead of CO 2 . Thus, rubisco also catalyzes a deleterious oxygenase re¬ 
action. The Products of this reaction are phosphoglycolate and 3-phospho¬ 
glycerate (Figure 20.6). The rate of the carboxylase reaction is four times 


CH2OPO32- 

I 

c=o 

I 

H—C—OH 

I 

H—C—OH 

I 

CH2OPO32- 

Ribulose 

1,5-bisphosphate 



CH2OPO32- 

I 

C—O- 

II 

C—OH 

I 

H—C—OH 

I 

CH 2 OPO 32 - 



Enediolate 

intermediate 


CH 2 OPO 32 


+ 


O—Ç— o — OH H 2 O H 2 O 


c=o 

H—C—OH 
CH2OPO32- 


Hydroperoxide 

intermediate 


CH 2 OPO 32 - 



Phosphoglycolate 

+ 

COO" 

I 

H—C—OH 

I 

CH20P03^‘ 

3-Phosphoglycerate 


FIGURE 20.6 A wastefuI síde reaction. The reactive enediolate intermediate on rubisco 
also reacts with molecular oxygen to form a hydroperoxide intermediate, which then pro- 
ceeds to form one molecule of 3-phosphoglycerate and one molecule of phosphoglycolate. 


that of the oxygenase reaction under normal atmospheric conditions at 
25°G; the stromal concentration of CO 2 is then 10 jiiM and that of O 2 is 
250 |ulM. The oxygenase reaction, like the carboxylase reaction, requires that 
lysine 201 be in the carbamate form. Because this carbamate forms only in 
the presence of CO 2 , this property would prevent rubisco from catalyzing 
the oxygenase reaction exclusively when CO 2 is absent. 

Phosphoglycolate is not a versatile metabolite. A sal vage pathway re- 
covers part of its carbon skeleton (Figure 20.7). A specific phosphatase con- 
verts phosphoglycolate into glycolate, which enters peroxisomes (also called 
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CHLOROPLAST 



FIGURE 20.7 Photorespiratory 
reactions. Phosphoglycolate is formed 
as a product of the oxygenase reaction 
in chioroplasts. After dephosphorylation, 
glycolate is transported into peroxisomes 
where it is converted into glyoxylate and 
then glycine. In mitochondria, two glycines 
are converted into serine, after losing 
a carbon as CO 2 and ammonia. The 
ammonia is salvaged in chioroplasts. 
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500 nm 


FIGURE 20.8 Electron micrograph 
of a peroxísome nestied between 
two chioroplasts. [Courtesy of Dr. Sue 
Ellen Frederick.] 
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2 Glyceraldehyde _^ Dihydroxyacetone 

3-phosphate ' phosphate 

NADP+ 

NADPH 

2 1,3-Bisphosphoglycerate 
ADP 

ATP 

2 3-Phosphoglycerate 

FIGURE 20.9 Hexose phosphate 
formatíon. 3-Phosphoglycerate is 
converted into fructose 6 -phosphate 
in a pathway parallel to that of 
glyconeogenesis. 




microbodies; Figure 20.8). Glycolate is then oxidized to glyoxylate by glyco- 
late oxidase, an enzyme with a flavin mononucleotide prosthetic group. The 
H 2 O 2 produced in this reaction is cleaved by catalase to H 2 O and O 2 . 
Transamination of glyoxylate then yields glycine. Two glycine molecules can 
be used to form serine, a potential precursor of glucose, with the release 
of CO 2 and ammonia (NH 4 ^). The ammonia, used in the synthesis of 
nitrogen-containing compounds, is salvaged by glutamine synthetase reaction. 

This salvage pathway serves to recycle three of the four carbon atoms of 
two molecules of glycolate. However, one carbon atom is lost as CO 2 . This 
process is called photorespiration because O 2 is consumed and CO 2 is re- 
leased. Photorespiration is wasteful because organic carbon is converted into 
CO 2 without the production of ATP, NADPH, or another energy-rich 
metabolite. Moreover, the oxygenase activity increases more rapidly with 
temperature than the carboxylase activity, presenting a problem for trop¬ 
ical plants (Section 20.2.3). Evolutionary processes have presumably en- 
hanced the preference of rubisco for carboxylation. For instance, the rubisco 
of higher plants is eightfold as specific for carboxylation as that of photo- 
synthetic bactéria. 

20.1.3 Hexose Phosphates Are Made from Phosphoglycerate, 
and Ribulose 1,5-bisphosphate Is Regenerated 

The 3-phosphoglycerate product of rubisco is next converted into three 
forms of hexose phosphate: glucose 1-phosphate, glucose 6-phosphate, and 
fructose 6-phosphate. Recall that these isomers are readily interconvertible 
(Sections 16.1.2 and 16.1.11). The steps in this conversion (Figure 20.9) are 
like those of the gluconeogenic pathway (Section 16.3.1), except that glyc¬ 
eraldehyde 3-phosphate dehydrogenase in chioroplasts, which generates 
glyceraldehyde 3-phosphate (GAP), is specific for NADPH rather than 
NADH. Alternatively, the glyceraldehyde 3-phosphate can be transported 
to the cytosol for glucose synthesis. These reactions and that catalyzed by 
rubisco bring CO 2 to the levei of a hexose, converting CO 2 into a chemical 
fuel at the expense of NADPH and ATP generated from the light reactions. 

The third phase of the Calvin cycle is the regeneration of ribulose 1,5- 
bisphosphate, the acceptor of CO 2 in the first step. The problem is to con- 
struct a five-carbon sugar from six-carbon and three-carbon sugar s. A 
transketolase and an aldolase play the major role in the rearrangement of 
the carbon atoms. The transketolase, which we will see again in the pentose 
phosphate pathway (Section 20.2.3), requires the coenzyme thiamine py- 
rophosphate (TPP) to transfer a two-carbon unit (CO-CH 2 OH) from a ke- 
tose to an aldose. 





0 ^ ^CH20H 
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0^ H 
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+ HO -C -H 


R' 

R 

1 

R 



R' 

Ketose 

Aldose 

Aldose 
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(n carbons) 
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(n - 2 carbons) 

(m + 2 carbons) 


We will consider the mechanism of transketolase when we meet it again 
in the pentose phosphate pathway (Section 20.3.2). Aldolase, which we have 
already encountered in glycolysis (Section 16.1.3), catalyzes an aldol con- 
densation between dihydroxyacetone phosphate and an aldehyde. This en¬ 
zyme is highly specific for dihydroxyacetone phosphate, but it accepts a 
wide variety of aldehydes. 
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With these enzymes, the construction of the five-carbon sugar proceeds as 
shown in Figure 20.10. 

Finally, ribose-5-phosphate is converted into ribulose 5-phosphate by 
phosphopentose isomerase while xylulose 5-phosphate is converted into ribu¬ 
lose 5-phosphate hy phosphopentose epimerase. Ribulose 5-phosphate is con¬ 
verted into ribulose 1,5-bisphosphate through the action of phosphoribulose 
kinase (Figure 20.11). The sum of these reactions is 

Fructose 6-phosphate + 2 glyceraldehyde 3-phosphate 

+ dihydroxyacetone phosphate + 3 ATP-> 

3 ribulose 1,5-bisphosphate + 3 ADP 
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0 ^ XH 2 OH 

FIGURE 20.10 Formatíon of five- 
carbon sugars. First, transketolase 

H—C- OH 

HO -C—H 

converts a six-carbon sugar and a 
three-carbon sugar into a four-carbon 

H—C-OH 

H—C-OH 

sugar and a five-carbon sugar. Then, 
aldolase combines the four-carbon 

H—C-OH 

CH 2 OPO 32 - 

product and a three-carbon sugar to 


form a seven-carbon sugar. Finally, this 

CH 20 P 03 ^ 


seven-carbon fragment combines with 

Ribose 

Xylulose 

another three-carbon fragment to form 

5-phosphate 

5-phosphate 

two additional five-carbon sugars. 
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FIGURE 20.11 Regeneration of 
ríbulose 1,5-bísphosphate. Both ribose 
5-phosphate and xylulose 5-phosphate are 
converted into ribulose 5-phosphate, which 
is then phosphorylated to complete the 
regeneration of ribulose 1,5-bisphosphate. 
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H—C—OH 
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O. .CH 2 OH 
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H—C—OH 

Phosphoribulose 
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Phosphopentose 

CH2OPO32- 


CH2OPO32- 

epimerase^^ 

Ribulose 


Ribulose 


5-phosphate 


1,5-bisphosphate 


Xylulose 5-phosphate 



FIGURE 20.12 Calvin cycle. The diagram shows the reactions necessary with the correct 
stoichiometry to convert three molecules of CO 2 into one molecule of DHAP. The cycle is 
not as simple as presented in Figure 20.1; rather, it entails many reactions that lead ulti- 
mately to the synthesis of glucose and the regeneration of ribulose 1,5-bisphosphate. [After 
J. R. Bowyer and R. C. Leegood. "Photosynthesis," in Plant Biochemistry, P. M. Dey and J. B. Harborne, 
Eds. (Academic Press, 1997), p. 85.] 


This series of reactions completes the Calvin cycle (Figure 20.12). The sum 
of all the reactions results in the generation of a hexose and the regenera¬ 
tion of the starting compound, ribulose 5-phosphate. In essence, ribulose 
1,5-bisphosphate acts catalytically, similarly to oxaloacetate in the citric acid 
cycle. 




























7552d_c20_551-576 8/28/01 


6:47 AM Page 559 mac45 Mac 45:lst 




ft: 


20.1.4 Three Molecules of ATP and Two Molecules of NADPH 
Are Used to Bring Carbon Dioxide to the Levei of a Hexose 

What is the energy expenditure for synthesizing a hexose? Six rounds of the 
Calvin cycle are required, because one carbon atom is reduced in each round. 
Twelve molecules of ATP are expended in phosphorylating 12 molecules 
of 3-phosphoglycerate to 1,3-bisphosphoglycerate, and 12 molecules of 
NADPH are consumed in reducing 12 molecules of 1,3-bisphosphoglycer- 
ate to glyceraldehyde 3-phosphate. An additional six molecules of ATP are 
spent in regenerating ribulose 1,5-bisphosphate. We can now write a bal- 
anced equation for the net reaction of the Calvin cycle. 

6 CO 2 + 18 ATP + 12 NADPH + I 2 H 2 O-> 

C6H12O6 + 18 ADP + 18 Pi + 12 NADP+ + 6 H+ 

Thus, three molecules of ATP and two molecules of NADPH are consumed 
in incorporating a single CO 2 molecule into a hexose such as glucose or 
fructose. 

20.1.5 Starch and Sucrose Are the Major Carbohydrate Stores 
in Plants 

Plants contain two major storage forms of sugar: starch and sucrose. Starch, 
like its animal counterpart glycogen, is a polymer of glucose residues, but 
it is less branched than glycogen because it contains a smaller proportion of 
a- 1 , 6 -glycosidic linkages (Section 11 .2.2). Another difference is that ADP- 
glucose, not UDP-glucose, is the activated precursor. Starch is synthesized 
and stored in chloroplasts. 

In contrast, sucrose (common table sugar), a disaccharide, is synthesized 
in the cytosol. Plants lack the ability to transport hexose phosphates across 
the chloroplast membrane, but an abundant phosphate translocator medi- 
ates the transport of triose phosphates from chloroplasts to the cytosol in 
exchange for phosphate. Fructose 6 -phosphate formed from triose phos¬ 
phates joins the glucose unit of UDP-glucose to form sucrose 6 -phosphate 
(Figure 20.13). Hydrolysis of the phosphate ester yields sucrose, a readily 


Triose phosphates 
(from chloroplasts) 



Fructose 6-phosphate 



Sucrose 6-phosphate synthase 



- 1 559 1 — 

The Calvin Cycle 


FIGURE 20.13 Synthesis of sucrose. 

Sucrose 6-phosphate is formed by the 
reaction between fructose 6-phosphate 
and the activated intermediate uridine 
diphosphate glucose (UDP-glucose). 
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Pentose Phosphate Pathway 




FIGURE 20.14 Líght regulatíon of the 
Calvin cycle. The light reactions of 
photosynthesis transfer electrons out of 
the thylakoid lumen into the stroma and 
they transfer protons from the stroma into 
the thylakoid lumen. As a consequence 
of these processes, the concentrations 
of NADPH, reduced ferredoxin (Fd), and 
in the stroma are higher in the 
light than in the dark, whereas the 
concentration of is lower in the dark. 
Each of these concentration changes helps 
couple the Calvin cycle reactions to the 
light reactions. 



FIGURE 20.15 Thioredoxín. The oxidized 
form of thioredoxin contains a disulfide 
bond. When thioredoxin is reduced by 
reduced ferredoxin, the disulfide bond is 
converted into two free sulfhydryl groups. 
Reduced thioredoxin can cleave disulfide 
bonds in enzymes, activating certain 
Calvin cycle enzymes and inactivating 
some degradative enzymes. 


20.2 THE ACTIVITY OF THE CALVIN CYCLE DEPENDS 
ON ENVIRONIVIENTAL CONDITIONS 

Carbon dioxide assimilation by the Calvin cycle operates during the day, 
whereas carbohydrate degradation to yield energy takes place primarily at 
night. How are synthesis and degradation coordinately controlled? The light 
reactions lead to changes in the stroma—namely, an increase in pH and in 
Mg^^, NADPH, and reduced ferredoxin concentration—all of which con- 
tribute to the activation of certain Calvin cycle enzymes (Figure 20.14). 

20.2.1 Rubisco Is Activated by Light-Driven Changes in Proton 
and Magnesium lon Concentrations 

As stated earlier, the rate-limiting step in the Calvin cycle is the carboxy- 
lation of ribulose 1,5-bisphosphate to form two molecules of 3-phospho- 
glycerate. The activity of rubisco increases markedly on illumination. The ad- 
dition of CO 2 to lysine 201 of rubisco to form the carbamate is essential for 
Mg^^ coordination and, hence, catalytic activity (Section 20.1.1). Carbamate 
formation is favored by alkaline pH and high concentrations of Mg^^ ion 
in the stroma, both of which are consequences of the light-driven pumping 
of protons from the stroma into the thylakoid space. Magnesium ion con¬ 
centration rises because Mg^^ ions from the thylakoid space are released 
into the stroma to compensate for the influx of protons. 

20.2.2 Thioredoxin Plays a Key Role in Regulating 
the Calvin Cycle 

Light-driven reactions lead to electron transfer from water to ferredoxin 
and, eventually, to NADPH. Both reduced ferredoxin and NADPH regu- 
late enzymes from the Calvin cycle. One key protein in these regulatory 
processes is thioredoxin, a 12-kd protein containing neighboring cysteine 
residues that cycle between a reduced sulfhydryl and an oxidized disulfide 
form (Figure 20.15). The reduced form of thioredoxin activates many 
biosynthetic enzymes by reducing disulfide bridges that control their ac¬ 
tivity and inhibits several degradative enzymes by the same means (Table 
20.1). In chloroplasts, oxidized thioredoxin is reduced by ferredoxin in a re- 
action catalyzed hy ferredoxin-thioredoxin reductase. This enzyme contains 


r 


TABLE 20.1 Enzymes regulated by thioredoxin 


Enzyme 

Pathway 

Rubisco 

Carbon fixation in the Calvin cycle 

Fructose 1 , 6 -bisphosphatase 

Gluconeogenesis 

Glyceraldehyde 3-phosphate 

Calvin cycle, gluconeogenesis. 

dehydrogenase 

glycolysis 

Sedoheptulose bisphosphatase 

Calvin cycle 

Glucose 6 -phosphate dehydrogenase 

Pentose phosphate pathway 

Phenylalanine ammonia lyase 

Lignin synthesis 

Ribulose 5'-phosphate kinase 

Calvin cycle 

NADP’^-malate dehydrogenase 

C 4 pathway 
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a 4Fe-4S cluster that couples two one-electron oxidations of reduced ferre- 
doxin to the two-electron reduction of thioredoxin. Thus, the activities of 
the light and dark reactions of photosynthesis are coordinated through electron 
transfer from reduced ferredoxin to thioredoxin and then to component enzymes 
containing regulatory disulfide bonds (Figure 20.16). We shall return to thiore¬ 
doxin when we consider the reduction of ribonucleotides (Section 25.3). 

Other means of control also exist. For instance, phosphoribulose kinase 
and glyceraldehyde 3-phosphate dehydrogenase also are regulated by 
NADPH directly. In the dark, these enzymes associate with a small protein 
called CPI2 to form a large complex in which the enzymes are inactivated. 
NADPH generated in the light reactions binds to this complex, leading to 
the release of the enzymes. Thus, the activity of these enzymes depends first 
on reduction by thioredoxin and then on the NADPH-mediated release 
from CPI2. 

20.2.3 The C 4 Pathway of Tropical Plants 

Accelerates Photosynthesis by Concentrating Carbon Dioxide 

Recall that the oxygenase activity of rubisco increases more rapidly with 
temperature than does its carboxylase activity. How then do plants, such as 
sugar cane, that grow in hot climates prevent very high rates of wasteful 
photorespiration? Their solution to this problem is to achieve a high local 
concentration of CO 2 at the site of the Calvin cycle in their photosynthetic 
cells. The essence of this process, which was elucidated by M. D. Hatch and 
C. R. Slack, is that four-carbon (C4) compounds such as oxaloacetate and 
malate carry CO 2 from mesophyll cells, which are in contact with air, to 
bundle-sheath cells, which are the major sites of photosynthesis (Figure 20.17). 
Decarboxylation of the four-carbon compound in a bundle-sheath cell 
maintains a high concentration of CO 2 at the site of the Calvin cycle. The 
three-carbon compound pyruvate returns to the mesophyll cell for another 
round of carboxylation. 

The C 4 pathway for the transport of CO 2 starts in a mesophyll cell with 
the condensation of CO 2 and phosphoenolpyruvate to form oxaloacetate, in 
a reaction catalyzed hy phosphoenolpyruvate carboxylase. In some species, oxa¬ 
loacetate is converted into malate by an NADP^-linked malate dehydroge¬ 
nase. Malate goes into the bundle-sheath cell and is oxidatively decarbox- 
ylated within the chloroplasts by an NADP^-linked malate dehydrogenase. 
The released CO 2 enters the Calvin cycle in the usual way by condensing 
with ribulose 1,5-bisphosphate. Pyruvate formed in this decarboxylation 
reaction returns to the mesophyll cell. Finally, phosphoenolpyruvate is 
formed from pyruvate hy pyruvate-Pi dikinase. 




FIGURE 20.16 Enzyme activatíon by 
thioredoxin. Reduced thioredoxin 
activates certain Calvin cycle enzymes by 
cleaving regulatory disulfide bonds. 
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CO, 
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FIGURE 20.17 C 4 pathway. Carbon 
dioxide is concentrated in bundle- 
sheath cells by the expenditure of ATP 
in mesophyll cells. 
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— I 562 I - The net reaction of this C 4 pathway is 

CHAPTER 20 • The Calvin Cycle and the 

Pentose Phosphate Pathway CO 2 (m mesophyll cell) + ATP + H 2 O-> 

CO 2 (in bundle-sheath cell) + AMP + 2 Pi + 

Thus, the energetic equivalent of two ATP molecules is consumed in trans- 
porting CO 2 to the chloroplasts of the bundle-sheath cells. In essence, this 
process is active transport: the pumping of CO 2 into the bundle-sheath cell 
is driven by the hydrolysis of one molecule of ATP to one molecule of AMP 
and two molecules of orthophosphate. The CO 2 concentration can be 20 - 
fold as great in the bundle-sheath cells as in the mesophyll cells. 

When the C 4 pathway and the Calvin cycle operate together, the net re¬ 
action is 

6 CO 2 + 30 ATP + 12 NADPH + I 2 H 2 O-> 

C 6 H 12 O 6 + 30ADP + 30Pi + 12NADP+ + 18 H+ 

Note that 30 molecules of ATP are consumed per hexose molecule formed 
when the C 4 pathway delivers CO 2 to the Calvin cycle, in contrast with 18 
molecules of ATP per hexose molecule in the absence of the C 4 pathway. 
The high concentration of CO 2 in the bundle-sheath cells of C 4 plants, 
which is due to the expenditure of the additional 12 molecules of ATP, is 
criticai for their rapid photosynthetic rate, because CO 2 is limiting when 
light is abundant. A high CO 2 concentration also minimizes the energy loss 
caused by photorespiration. 

Tropical plants with a C 4 pathway do little photorespiration because the 
high concentration of CO 2 in their bundle-sheath cells accelerates the car- 
boxylase reaction relative to the oxygenase reaction. This effect is especially 
important at higher temperatures. The geographic distribution of plants 
having this pathway (C 4 plants) and those lacking it (C 3 plants) can now be 
understood in molecular terms. C 4 plants have the advantage in a hot envi- 
ronment and under high illumination, which accounts for their prevalence 
in the tropics. C 3 plants, which consume only 18 molecules of ATP per hex¬ 
ose molecule formed in the absence of photorespiration (compared with 30 
molecules of ATP for C 4 plants), are more efficient at temperatures of less 
than about 28°C, and so they predominate in temperate environments. 

Y Rubisco is found in bactéria, eukaryotes, and even archaea, though 
other photosynthetic components have not been found in archaea. 
Thus, rubisco emerged early in evolution, when the atmosphere was rich in 
CO 2 and almost devoid of O 2 . The enzyme was not originally selected to 
operate in an environment like the present one, which is almost devoid of 
CO 2 and rich in O 2 . Photorespiration became significant about 60 million 
years ago, when the CO 2 concentration fell to present leveis. The C 4 path¬ 
way is thought to have evolved in response to this selective pressure no more 
than 30 million years ago and possibly as recently as 7 million years ago. It 
is interesting to note that none of the enzymes are unique to C 4 plants, sug- 
gesting that this pathway was created using existing enzymes. 


20.2.4 Crassulacean Acid Metabolism Permits Growth 
in Arid Ecosystems 

Crassulacean acid metabolism (CAM) is yet another adaptation to increase 
the efficiency of the Calvin cycle. Crassulacean acid metabolism, named af- 
ter the genus Crassulacea (the succulents), is a response to drought as well 
as warm conditions. In CAM plants, the stomata of the leaves are closed in 
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the heat of the day to prevent water loss (Figure 20.18). As a consequence, 
CO 2 cannot be absorbed during the daylight hours when it is needed for 
glucose synthesis. When the stomata open at the cooler temperatures of 
night, CO 2 is fixed by the C 4 pathway into malate, which is stored in vac- 
uoles. During the day, malate is decarboxylated and the CO 2 becomes avail- 
able to the Calvin cycle. In contrast with C 4 plants, CO 2 accumulation is 
separated from CO 2 utilization temporally in CAM plants rather than 
spatially. 


20.3 THE PENTOSE PHOSPHATE PATHWAY GENERATES 
NADPH AND SYNTHESIZES FIVE-CARBON SUGARS 

The pentose phosphate pathway meets the need of all organisms for a source 
of NADPH to use in reductive biosynthesis (Table 20.2). This pathway con- 
sists of two phases: the oxidative generation of NADPH and the non- 
oxidative interconversion of sugars (Figure 20.19). In the oxidative phase, 
NADPH is generated when glucose 6 -phosphate is oxidized to ribose 5- 
phosphate. This five-carbon sugar and its derivatives are components of 
RNA and DNA, as well as ATP, NADH, FAD, and coenzyme A. 

Glucose 6 -phosphate + 2 NADP^ + H 2 O-> 

ribose 5-phosphate + 2 NADPH + 2 H^ + CO 2 

In the nonoxidative phase, the pathway catalyzes the interconversion of 
three-, four-, five-, six-, and seven-carbon sugars in a series of nonoxida¬ 
tive reactions that can result in the synthesis of five-carbon sugars for 
nucleotide biosynthesis or the conversion of excess five-carbon sugars into 
intermediates of the glycolytic pathway. All these reactions take place in the 
cytosol. These interconversions rely on the same reactions that lead to the 
regeneration of ribulose 1,5-bisphosphate in the Calvin cycle. 


PHASE1 
(oxidative) 


Glucose 6 -phosphate 



2 NADP+ 
2 NADPH 


Ribulose 5-phosphate 




Ribose Xylulose 

5-phosphate (C5) 5-phosphate (C5) 


r.AP (C '> Sedoheptulose 
(^ 3 ; 7-phosphate (C,) 



Fructose 

6 -phosphate (C 5 ) 


Erythose Xylulose 

4-phosphate (C4) 5-phosphate (C5) 


PHASE 2 
(nonoxidative) 



GAP (C3) 




FIGURE 20.18 Electron micrograph of 
an open stoma and a closed stoma. 

[Herb Charles Ohimeyer/Fran Heyl Associates.] 


TABLE 20.2 Pathways requiring 
NADPH 


Synthesis 

Fatty acid biosynthesis 
Cholesterol biosynthesis 
Neurotransmitter biosynthesis 
Nucleotide biosynthesis 

Detoxification 

Reduction of oxidized glutathione 
Cytochrome P450 monooxygenases 


FIGURE 20.19 Pentose phosphate 
pathway. The pathway consists of 
( 1 ) an oxidative phase that generates 
NADPH and (2) a nonoxidative phase 
that interconverts phosphorylated sugars. 
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20.3.1 Two Molecules of NADPH Are Generated in the Conver- 
sion of Glucose 6-phosphate into Ribulose 5-phosphate 

The oxidative phase of the pentose phosphate pathway starts with the de- 
hydrogenation of glucose 6-phosphate at carbon 1, a reaction catalyzed by 
glucose 6-phosphate dehydrogenase (Figure 20.20). This enzyme is highly spe- 
cific for NADP^; the Km for NAD^ is about a thousand times as great as 
that for NADP^. The product is 6 -phosphoglucono-8-lactone, which is an 
intramolecular ester between the C-1 carboxyl group and the C-5 hydroxyl 
group. The next step is the hydrolysis of 6-phosphoglucono-8-lactone by a 
specific lactonase to give 6 -phosphogluconate. This six-carbon sugar is then 
oxidatively decarboxylated by 6 -phosphogluconate dehydrogenase to yield 
ribulose 5-phosphate. NADP^ is again the electron acceptor. The final step 
in the synthesis of ribose 5-phosphate is the isomerization of ribulose 5- 
phosphate by phosphopentose isomerase (see Figure 20.11) 



Glucose 

6-phosphate 


6-Phosphoglucono- 

6-lactone 


H—C-OH 


O^^/CH^OH 

HO-C—H 

NADP+ NADPH 

H—OH +CO 2 

H—C-OH 

6-Phosphogluconate 

H—C—OH 

CH 2 OPO 32 - 

H—C-OH 

dehydrogenase 

CH 2 OPO 32 - 

6-Phospho- 


Ribulose 

gluconate 


5-phosphate 


FIGURE 20.20 Oxidative phase of the pentose phosphate pathway. Glucose 6 -phos- 
phate is oxidized to 6 -phosphoglucono- 8 -lactone to generate one molecule of NADPH. The 
lactone product is hydrolyzed to 6 -phosphogluconate, which is oxidatively decarboxylated to 
ribulose 5-phosphate with the generation of a second molecule of NADPH. 


20.3.2 The Pentose Phosphate Pathway and Glycolysis Are 
Linked by Transketolase and Transaldolase 

The preceding reactions yield two molecules of NADPH and one molecule 
of ribose 5-phosphate for each molecule of glucose 6-phosphate oxidized. 
However, many cells need NADPH for reductive biosyntheses much more 
than they need ribose 5-phosphate for incorporation into nucleotides and 
nucleic acids. In these cases, ribose 5-phosphate is converted into glycer- 
aldehyde 3-phosphate and fructose 6-phosphate by transketolase and 
transaldolase. These enzymes create a reversible link between the pentose phos¬ 
phate pathway and glycolysis by catalyzing these three successive reactions. 

Transketolase 

Cs + Cs . Cs + C7 


Transaldolase 

Cs + C7 . 


Q + C 4 


C4 + Cs . C, + Cs 

The net result of these reactions is the formation of two hexoses and one triose 
from three pentoses: 

3Cs ^ 2Ce + C3 

The first of the three reactions linking the pentose phosphate pathway 
and glycolysis is the formation of glyceraldehyde 3-phosphate and sedohep- 
tulose 7-phosphate from two pentoses. 
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0^ XH2OH 

1 

H—C-OH 


0^ XH2OH 


HO -C-H 

HO-C—H 

Transketolase 

H—C-OH 

+ H—C-OH 

V - H—C OH 

+ 

H—C-OH 

1 

H—C-OH 

H—C-OH 

CH2OPO32- 


CH2OPO32- 


H—C-OH 

CH2OPO32- 

Xylulose Ribose 

Glyceraldehyde 

CH2OPO32- 

Sedoheptulose 

S-phosphate 5-phosphate 

3-phosphate 

7-phosphate 


The donor of the two-carbon unit in this reaction is xylulose 5-phosphate, 
an epimer of ribulose 5-phosphate. A ketose is a substrate of transketolase 
only if its hydroxyl group at C-3 has the configuration of xylulose rather 
than ribulose. Ribulose 5-phosphate is converted into the appropriate 
epimer for the transketolase reaction hy phosphopentose epimer ase (see Figure 
20 .11) in the reverse reaction of that which occurs in the Calvin cycle. 

Glyceraldehyde 3-phosphate and sedoheptulose 7-phosphate generated 
by the transketolase then react to form fructose 6-phosphate and erythrose 4- 
phosphate. 


H—C-OH 

CH2OPO32- 


+ 


I XH2OH 

HO— Ç—H °^^/H 

H—C—OH HO C—H I 

I Transaldolase | ^ 

H—C OH V — H—C— OH + I 

I I H—C-OH 

H—C OH H—C—OH I 

I I CH 2 OPO 32 - 

CH2OPO32- CH2OPO32- 


Glyceraldehyde Sedoheptulose 

3-phosphate 7-phosphate 


Fructose Erythrose 

6-phosphate 4-phosphate 


This synthesis of a four-carbon sugar and a six-carbon sugar is catalyzed 
by transaldolase. 

In the third reaction, transketolase catalyzes the synthesis of fructose 6- 
phosphate and glyceraldehyde 3-phosphate from erythrose 4-phosphate and 
xylulose 5-phosphate. 





0^ XH2OH 
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0^ XH2OH 
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HO -C—H 
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:— OH 

hlO Ç H Transketolase 





+ , 

H—C—OH 

+ H—C-OH 

H—( 

: — OH 

H—C-OH 






H—C—OH 

CH20P032- 

( 

:h 20P032- 

CH2OPO32- 






CH2OPO32- 


Erythrose 

Xylulose 

Fructose 

Glyceraldehyde 

4-phosphate 

5-phosphate 

6-phosphate 

3-phosphate 


The sum of these reactions is 

2 Xylulose 5-phosphate + ribose 5-phosphate ^ 

2 fructose 6-phosphate + glyceraldehyde 3-phosphate 

Xylulose 5-phosphate can be formed from ribose 5-phosphate by the se- 
quential action of phosphopentose isomerase and phosphopentose epimerase, 
and so the net reaction starting from ribose 5-phosphate is 

3 Ribose 5-phosphate ^ 

2 fructose 6-phosphate + glyceraldehyde 3-phosphate 
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TABLE 20.3 Pentose phosphate pathway 


Reaction 


Enzyme 


Oxidative phase 

Glucose 6-phosphate + NADP"^-> 6-phosphoglucono-ô-lactone + NADPH + 

6-Phosphoglucono-ô-lactone + H 2 O-> 6-phosphogluconate + H"^ 

6-Phosphogluconate + NADP"^-> ribulose 5-phosphate + CO 2 + NADPH 


Nonoxidative Phase 

Ribulose 5-phosphate; 
Ribulose 5-phosphate; 


: ribose 5-phosphate 
: xylulose 5-phosphate 


Xylulose 5-phosphate + ribose 5-phosphate ^ 

sedoheptulose 7-phosphate + glyceraldehyde 3-phosphate 
Sedoheptulose 7-phosphate + glyceraldehyde 3-phosphate ^ 

fructose 6-phosphate + erythrose 4-phosphate 
Xylulose 5-phosphate + erythrose 4-phosphate ^ 

fructose 6-phosphate + glyceraldehyde 3-phosphate 


Glucose 6-phosphate dehydrogenase 
Lactonase 

6-Phosphogluconate dehydrogenase 


Phosphopentose isomerase 
Phosphopentose epimerase 
Transketolase 

Transaldolase 

Transketolase 


Thus, excess ribose 5-phosphate formed by the pentose phosphate pathway can 
be completely converted into glycolytic intermediates. Moreover, any ribose 
ingested in the diet can be processed into glycolytic intermediates by this 
pathway. It is evident that the carbon skeletons of sugars can be extensively 
rearranged to meet physiologic needs (Table 20.3). 


FIGURE 20.21 Transketolase 
mechanísm. The carbanion of thiamine 
pyrophosphate (TPP) attacks the ketose 
substrate. Cleavage of a carbon-carbon 
bond frees the aldose product and leaves 
a two-carbon fragment joined to TPP. 

This activated glycoaldehyde intermediate 
attacks the aldose substrate to form a new 
carbon-carbon bond. The ketose product 
is released, freeing the TPP for the next 
reaction cycle. 


20.3.3 Transketolase and Transaldolase Stabilize Carbanionic 
Intermediates by Different Mechanisms 

The reactions catalyzed by transketolase and transaldolase are distinct yet 
similar in many ways. One difference is that transketolase transfers a two- 
carbon unit, whereas transaldolase transfers a three-carbon unit. Each of 
these units is transiently attached to the enzyme in the course of the reac¬ 
tion. In transketolase, the site of addition of the unit is the thiazole ring of 
the required coenzyme thiamine pyrophosphate. Transketolase is homolo- 
gous to the El subunit of the pyruvate dehydrogenase complex (Section 
17.1.1) and the reaction mechanism is similar (Figure 20.21). 
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The C-2 carbon atom of bound TPP readily ionizes to give a carbanion. I 567 | 

The negatively charged carbon atom of this reactive intermediate attacks The Pentose Phosphate Pathway 

the carbonyl group of the ketose substrate. The resulting addition com- 
pound releases the aldose product to yield an activated glycoaldehyde unit. 

The positively charged nitrogen atom in the thiazole ring acts as an electron 
sink in the development of this activated intermediate. The carbonyl group 
of a suitable aldose acceptor then condenses with the activated glycoalde¬ 
hyde unit to form a new ketose, which is released from the enzyme. 

Transaldolase transfers a three-carbon dihydroxyacetone unit from a 
ketose donor to an aldose acceptor. Transaldolase, in contrast with trans- 
ketolase, does not contain a prosthetic group. Rather, a Schiffbase isformed 
between the carbonyl group of the ketose substrate and the e-amino group of 
a lysine residue at the active site of the enzyme (Figure 20.22). This kind of 
covalent enzyme-substrate intermediate is like that formed in fructose 
1 ,6-bisphosphate aldolase in the glycolytic pathway (Section 16.1.3) and, 
indeed, the enzymes are homologous. The Schiffbase becomes protonated, 
the bond between C-3 and C-4 is split, and an aldose is released. The neg- 
ative charge on the Schiff-base carbanion moiety is stabilized by resonance. 

The positively charged nitrogen atom of the protonated Schiff base acts 
as an electron sink. The Schiff-base adduct is stable until a suitable aldose 
becomes bound. The dihydroxyacetone moiety then reacts with the car¬ 
bonyl group of the aldose. The ketose product is released by hydrolysis 
of the Schiff base. The nitrogen atom of the protonated Schiff base plays 
the same role in transaldolase as the thiazole-ring nitrogen atom does in 
transketolase. In each enzyme, a group within an intermediate reacts like 


/NHp 


Lysine 



Ketose 

substrate 



Schiff base 


/NH2 




Ketose 

product 



CH2OH 


Protonated 
Schiff base 


Aldose product / 




o 

c 








CH2OH 


l_l 

Aldose substrate / 



FIGURE 20.22 Transaldolase mechanism. The reaction begins with the formation of a 
Schiff base between a lysine residue in transaldolase and the ketose substrate. Protonation 
of the Schiff base leads to release of the aldose product, leaving a three-carbon fragment 
attached to the lysine residue. This intermediate adds to the aldose substrate to form a new 
carbon-carbon bond. The reaction cycle is completed by release of the ketose product from 
the lysine side chain. 
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FIGURE 20.23 Carbaníon íntermedíates. 

For transketolase and transaldolase, a 
carbanion intermediate is stabilized by 
resonance. In transketolase, TPP stabilizes 
this intermediate; in transaldolase, a 
protonated Schiff base plays this role. 



CH 2 OH 
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a carbanion in attacking a carbonyl group to form a new carbon-carbon 
bond. In each case, the charge on the carbanion is stabilized by resonance 
(Figure 20.23). 



CONCEPTUAL INSIGHTS, 


OverView of Carbohydrate and Fatty 
Acíd Metabolism. View this media 
module to gain a "bigger picture" under- 
standing of the roles of the pentose 
phosphate pathway in the context of 
other metabolic pathways (glycolysis, 
citric add cyde, glycogen and fatty 
add metabolism). 


20.4 THE METABOLISIVI OF GLUCOSE 6-PHOSPHATE 
BY THE PENTOSE PHOSPHATE PATHWAY IS 
COORDINATED WITH GLYCOLYSIS 

Glucose 6-phosphate is metabolized by both the glycolytic pathway (Chap- 
ter 16) and the pentose phosphate pathway. How is the processing of this 
important metabolite partitioned between these two metabolic routes? The 
cytoplasmic concentration of NADP^ plays a key role in determining the 
fate of glucose 6-phosphate. 

20.4.1 The Rate of the Pentose Phosphate Pathway Is 
Controlled by the Levei of NADP^ 

The first reaction in the oxidative branch of the pentose phosphate path¬ 
way, the dehydrogenation of glucose 6-phosphate, is essentially irreversible. 
In fact, this reaction is rate limiting under physiological conditions and 
serves as the control site. The most important regulatory factor is the levei 
of NADP^, the electron acceptor in the oxidation of glucose 6-phosphate 
to 6-phosphoglucono-8-lactone. The inhibitory effect of low leveis of 
NADP^ is exacerbated by the fact that NADPH competes with NADP^ 
in binding to the enzyme. The ratio of NADP^ to NADPH in the cytosol 
of a li ver cell from a well-fed rat is about 0.014, several orders of magnitude 
lower than the ratio of NAD^ to NADH, which is 700 under the same con¬ 
ditions. The marked effect of the NADP^ levei on the rate of the oxidative 
phase ensures that NADPH generation is tightly coupled to its utilization 
in reductive biosyntheses. The nonoxidative phase of the pentose phosphate 
pathway is controlled primarily by the availability of substrates. 


20.4.2 The Flow of Glucose 6-phosphate Depends on the Need 
for NADPH, Ribose 5-phosphate, and ATP 

We can grasp the intricate interplay between glycolysis and the pentose 
phosphate pathway by examining the metabolism of glucose 6-phosphate 
in four different metabolic situations (Figure 20.24). 

Mode 1. Much more ribose 5-phosphate than NADPH is required. For ex- 
ample, rapidly dividing cells need ribose 5-phosphate for the synthesis of 
nucleotide precursors of DNA. Most of the glucose 6-phosphate is converted 
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FIGURE 20.24 Four modes of the pentose phosphate pathway. Major products are 
shown in color. 


[ Pyruvate ] 


2 ATP 


into fructose 6-phosphate and glyceraldehyde 3-phosphate by the glycolytic 
pathway. Transaldolase and transketolase then convert two molecules of 
fructose 6-phosphate and one molecule of glyceraldehyde 3-phosphate 
into three molecules of ribose 5-phosphate by a reversal of the reactions 
described earlier. The stoichiometry of mode 1 is 

5 Glucose 6-phosphate + ATP-> 6 ribose 5-phosphate + ADP + 

Mode 2. The needs for NADPH and ribose 5-phosphate are balanced. The 
predominant reaction under these conditions is the formation of two mole¬ 
cules of NADPH and one molecule of ribose 5-phosphate from one mole¬ 
cule of glucose 6-phosphate in the oxidative phase of the pentose phosphate 
pathway. The stoichiometry of mode 2 is 

Glucose 6-phosphate + 2 NADP^ + H 2 O-> 

ribose 5-phosphate + 2 NADPH + 2 H^ + CO 2 

Mode 3. Much more NADPH than ribose 5-phosphate is required. For ex- 
ample, adipose tissue requires a high levei of NADPH for the synthesis of 
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TABLE 20.4 Tissues with active pentose phosphate pathways 


Tissue 

Function 

Adrenal gland 

Steroid synthesis 

Liver 

Fatty acid and cholesterol synthesis 

Testes 

Steroid synthesis 

Adipose tissue 

Fatty acid synthesis 

Ovary 

Steroid synthesis 

Mammary gland 

Fatty acid synthesis 

Red blood cells 

Maintenance of reduced glutathione 


fatty acids (Table 20.4). In this case, glucose 6-phosphate is completely ox- 
idized to CO 2 . Three groups of reactions are active in this situation. First, 
the oxidative phase of the pentose phosphate pathway forms two molecules 
of NADPH and one molecule of ribose 5-phosphate. Then, ribose 5-phos¬ 
phate is converted into fructose 6-phosphate and glyceraldehyde 3-phos¬ 
phate by transketolase and transaldolase. Finally, glucose 6-phosphate is 
resynthesized from fructose 6-phosphate and glyceraldehyde 3-phosphate 
by the gluconeogenic pathway. The stoichiometries of these three sets of 
reactions are 

6 Glucose 6 -phosphate + 12 NADP^ + 6 H 2 O-> 

6 ribose 5-phosphate + 12 NADPH + 12 + 6 CO 2 

6 Ribose 5-phosphate-> 

4 fructose 6-phosphate + 2 glyceraldehyde 3-phosphate 

4 Fructose 6-phosphate + 2 glyceraldehyde 3-phosphate + H 2 O-> 

5 glucose 6-phosphate + P^ 

The sum of the mode 3 reactions is 

Glucose 6-phosphate + 12NADP^ + 7 H 2 O-> 

6 CO 2 + 12 NADPH + 12 H+ + Pi 

Thus, the equivalent of glucose 6-phosphate can be completely oxidized to CO 2 
with the concomitant generation of NADPH. In essence, ribose 5-phosphate 
produced by the pentose phosphate pathway is recycled into glucose 
6-phosphate by transketolase, transaldolase, and some of the enzymes of the 
gluconeogenic pathway. 

Mode 4. Both NADPH and ATP are required. Alternatively, ribose 5- 
phosphate formed by the oxidative phase of the pentose phosphate path¬ 
way can be converted into pyruvate. Fructose 6-phosphate and glyceralde¬ 
hyde 3-phosphate derived from ribose 5-phosphate enter the glycolytic 
pathway rather than reverting to glucose 6-phosphate. In this mode, ATP 
and NADPH are concomitantly generated, andfive of the six carbons of glu¬ 
cose 6-phosphate emerge in pyruvate. 

3 Glucose 6-phosphate + 6NADP^ + 5NAD^ + 5 Pi + 8ADP-> 

5 pyruvate + 3 CO 2 + 6 NADPH + 5 NADH 

+ 8ATP + 2 H 2 O + 8H+ 

Pyruvate formed by these reactions can be oxidized to generate more ATP 
or it can be used as a building block in a variety of biosyntheses. 
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20.4.3 Through the Looking Glass: The Calvin Cycle and the 
Pentose Phosphate Pathway 

The complexities of the Calvin cycle and the pentose phosphate pathway 
are easier to comprehend if we consider them mirror images of each other. 
The Calvin cycle begins with the fixation of CO 2 and goes on to use 
NADPH in the synthesis of glucose. The pentose phosphate pathway be¬ 
gins with the oxidation of a glucose-derived carbon atom to CO 2 and con- 
comitantly generates NADPH. The regeneration phase of the Calvin cycle 
converts Ce and C 3 molecules back into the starting material—the C 5 mol- 
ecule ribulose 1,5-bisphosphate. The pentose phosphate pathway converts 
a C 5 molecule, ribose 5-phosphate, into Ce and C 3 intermediates of the gly- 
colytic pathway. Not surprisingly, in photosynthetic organisms, many en- 
zymes are common to the two pathways. We see the economy of evolution: 
the use of identical enzymes for similar reactions with different ends. 


20.5 GLUCOSE 6-PHOSPHATE DEHYDROGENASE PLAYS 
A KEY ROLE IN PROTECTION AGAINST REACTIVE 
OXYGEN SPECIES 



Electron micrograph of a chioroplast. 

The thylakoid membranes course through- 
out the stroma of a chioroplast from a cell 
of Phleum pratense, a grass. The dark 
areas of stacked thylakoid membrane are 
grana. Several large starch granules, which 
store the newly synthesized glucose, are 
aiso obvious. [Biophoto Associates/Photo 
Researchers.] 


Reactive oxygen species (ROS) generated in oxidative metabolism inflict 
damage on all classes of macromolecules and can ultimately lead to cell 
death. Indeed, ROS are implicated in a number of human diseases (Section 
18.3.6). Reduced glutathione (GSH), a tripeptide with a free sulfhydryl 
group, is required to combat oxidative stress and maintain the normal re¬ 
duced State in the cell. Oxidized glutathione (GSSG) is reduced by NADPH 
generated by glucose 6 -phosphate dehydrogenase in the pentose phosphate 
pathway. Indeed, cells with reduced leveis of glucose 6 -phosphate dehy¬ 
drogenase are especially sensitive to oxidative stress. This stress is most 
acute in red blood cells because, lacking mitochondria, they have no alter¬ 
nativo means of generating reducing power. 

20.5.1 Glucose 6-phosphate Dehydrogenase Deficiency Causes 
a Drug-Induced Hemolytic Anemia 

7 At its introduction in 1926, an antimalarial drug, pamaquine, was 
associated with the appearance of severe and mysterious ailments. 
Most patients tolerated the drug well, but a few developed severe symp- 
toms within a few days after therapy was started. The urine turned black, 
jaundice developed, and the hemoglobin content of the blood dropped 
sharply. In some cases, massive destruction of red blood cells caused death. 

This drug-induced hemolytic anemia was shown 30 years later to be 
caused by a deficiency of glucose 6-phosphate dehydrogenase, the enzyme cat- 
alyzing the first step in the oxidative branch of the pentose phosphate path¬ 
way. This defect, which is inherited on the X chromosome, is the most com¬ 
mon enzymopathy, affecting hundreds of millions of people. The major role 
of NADPH in red cells is to reduce the disulfide form of glutathione to the 
sulfhydryl form. The enzyme that catalyzes the regeneration of reduced glu¬ 
tathione, the flavoprotein glutathione reductase, a dimer of 50-kd subunits, 
is homologous to ferredoxin-NADP^ reductase, which we encountered in 
photosynthesis (Section 19.3.4). The reduced form of glutathione serves as a 
sulfhydryl buffer that maintains the cysteine residues of hemoglobin and other 
red-blood-cell proteins in the reduced State. Normally, the ratio of the reduced 
to oxidized forms of glutathione in red blood cells is 500. 

How is GSH regenerated from GSSG and NADPH by glutathione re¬ 
ductase? The electrons from NADPH are not directly transferred to the 



Glutathione (reduced) 
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Vida faba. The Mediterranean plant Vicia 
faba is a source of fava beans that contain 
the purine glycoside vicine. [Inga Spence/ 
Visuais Uniimited.] 



FIGURE 20.25 Red blood cells wíth 
Heinz bodíes. The light micrograph 
shows red blood cells obtained from a 
person deficient in glucose 6-phosphate 
dehydrogenase. The dark particles, called 
Heinz bodies, inside the cells are clumps 
of denatured protein that adhere to the 
plasma membrane and stain with basic 
dyes. Red blood cells in such people are 
highly susceptible to oxidative damage. 
[Courtesy of Dr. Stanley Schrier.] 


disulfide bond in oxidized glutathione. Rather, they are transferred from 
NADPH to a tightly bound flavin adenine dinucleotide (FAD) on the re- 
ductase, then to a disulfide bridge between two cysteine residues in the en- 
zyme subunit, and finally to oxidized glutathione. 

Reduced glutathione is essential for maintaining the normal structure of 
red blood cells and for keeping hemoglobin in the ferrous State. The reduced 
form also plays a role in detoxification by reacting with hydrogen peroxide 
and organic per oxides. 

2 GSH + ROOH -> GSSG + H 2 O + ROH 

Cells with a lowered levei of reduced glutathione are more susceptible to 
hemolysis. How can we explain this phenomenon biochemically? The pres- 
ence of pamaquine, a purine glycoside of fava beans, or other nonenzymatic 
oxidative agents leads to the generation of per oxides, reactive oxygen species 
that can damage membranes as well as other biomolecules. Peroxides are 
normally eliminated by glutathione peroxidase with the use of glutathione 
as a reducing agent (Section 24.4). Moreover, in the absence of the enzyme, 
the hemoglobin sulfhydryl groups can no longer be maintained in the re¬ 
duced form and hemoglobin molecules then cross-link with one another to 
form aggregates called Heinz bodies on cell membranes (Figure 20.25). 
Membranes damaged by the Heinz bodies and reactive oxygen species be- 
come deformed and the cell is likely to undergo lysis. In the absence of ox¬ 
idative stress, however, the deficiency is quite benign. The occurrence of 
this dehydrogenase deficiency also clearly demonstrates that atypical reac- 
tions to drugs may have a genetic basis. 

20.5.2 A Deficiency of Glucose 6-phosphate Dehydrogenase 
Confers an Evolutionary Advantage in Some Circumstances 

The incidence of the most common form of glucose 6-phosphate 

dehydrogenase deficiency, characterized by a tenfold reduction in en- 
zymatic activity in red blood cells, is 11% among Americans of African her- 
itage. This high frequency suggests that the deficiency may be advantageous 
under certain environmental conditions. Indeed, glucose 6-phosphate dehy¬ 
drogenase deficiency protects against falciparum malaria. The parasites caus- 
ing this disease require reduced glutathione and the products of the pentose 
phosphate pathway for optimal growth. Thus, glucose 6-phosphate dehy¬ 
drogenase deficiency is a mechanism of protection against malaria, which 
accounts for its high frequency in malaria-infested regions of the world. We 
see here once again the interplay of heredity and environment in the pro- 
duction of disease. 


SUMMARY 


The Calvin Cycle Synthesizes Hexoses from Carbon Dioxide and Water 

ATP and NADPH formed in the light reactions of photosynthesis are 
used to convert CO 2 into hexoses and other organic compounds. The 
dark phase of photosynthesis, called the Calvin cycle, starts with the re- 
action of CO 2 and ribulose 1,5-bisphosphate to form two molecules of 
3-phosphoglycerate. The steps in the conversion of 3-phosphoglycer- 
ate into fructose 6-phosphate and glucose 6-phosphate are like those of 
gluconeogenesis, except that glyceraldehyde 3-phosphate dehydroge¬ 
nase in chloroplasts is specific for NADPH rather than NADH. Ribu¬ 
lose 1,5-bisphosphate is regenerated from fructose 6-phosphate, glycer¬ 
aldehyde 3-phosphate, and dihydroxyacetone phosphate by a complex 
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series of reactions. Several of the steps in the regeneration of ribulose 
1,5-bisphosphate are like those of the pentose phosphate pathway. 
Three molecules of ATP and two molecules of NADPH are consumed 
for each molecule of CO 2 converted into a hexose. Starch in chloroplasts 
and sucrose in the cytosol are the major carbohydrate Stores in plants. 

• The Activity of the Calvin Cycle Depends on Environmental Conditions 

Reduced thioredoxin formed by the light-driven transfer of electrons 
from ferredoxin activates enzymes of the Calvin cycle by reducing disul- 
fide bridges. The light-induced increase in pH and Mg^^ levei of the 
stroma is important in stimulating the carboxylation of ribulose 1 ,5 -bis- 
phosphate by ribulose 1,5-bisphosphate carboxylase. This enzyme also 
catalyzes a competing oxygenase reaction, which produces phospho- 
glycolate and 3-phosphoglycerate. The recycling of phosphoglycolate 
leads to the release of CO 2 and further consumption of O 2 in a process 
called photorespiration. This wasteful side reaction is minimized in 
tropical plants, which have an accessory pathway—called the C 4 path¬ 
way—for concentrating CO 2 at the site of the Calvin cycle. This path¬ 
way enables tropical plants to take advantage of high leveis of light and 
minimize the oxygenation of ribulose 1,5-bisphosphate. Plants in arid 
ecosystems employ Crassulacean acid metabolism (CAM) to prevent de- 
hydration. In CAM plants, the C 4 pathway is active during the night 
when the plant exchanges gases with the air. During the day, gas exchange 
is eliminated and CO 2 is generated from malate stored in vacuoles. 

• The Pentose Phosphate Pathway Generates NADPH and Synthesizes 
Five-Carbon Sugars 

Whereas the Calvin cycle is present only in photosynthetic organisms, 
the pentose phosphate pathway is present in all organisms. The pentose 
phosphate pathway generates NADPH and ribose 5-phosphate in the 
cytosol. NADPH is used in reductive biosyntheses, whereas ribose 5- 
phosphate is used in the synthesis of RNA, DNA, and nucleotide coen- 
zymes. The pentose phosphate pathway starts with the dehydrogena- 
tion of glucose 6 -phosphate to form a lactone, which is hydrolyzed to 
give 6 -phosphogluconate and then oxidatively decarboxylated to yield 
ribulose 5-phosphate. NADP^ is the electron acceptor in both of these 
oxidations. The last step is the isomerization of ribulose 5-phosphate (a 
ketose) to ribose 5-phosphate (an aldose). A different mode of the path¬ 
way is active when cells need much more NADPH than ribose 5- 
phosphate. Under these conditions, ribose 5-phosphate is converted into 
glyceraldehyde 3-phosphate and fructose 6 -phosphate by transketolase 
and transaldolase. These two enzymes create a reversible link between 
the pentose phosphate pathway and glycolysis. Xylulose 5-phosphate, 
sedoheptulose 7-phosphate, and erythrose 4-phosphate are intermedi- 
ates in these interconversions. In this way, 12 molecules of NADPH can 
be generated for each molecule of glucose 6 -phosphate that is com- 
pletely oxidized to CO 2 . 

• The Metabolism of Glucose 6-phosphate by the Pentose Phosphate 
Pathway Is Coordinated with Glycolysis 

Only the nonoxidative branch of the pathway is significantly active 
when much more ribose 5-phosphate than NADPH needs to be syn- 
thesized. Under these conditions, fructose 6 -phosphate and glyceralde¬ 
hyde 3-phosphate (formed by the glycolytic pathway) are converted into 
ribose 5-phosphate without the formation of NADPH. Alternatively, 
ribose 5-phosphate formed by the oxidative branch can be converted 
into pyruvate through fructose 6 -phosphate and glyceraldehyde 
3-phosphate. In this mode, ATP and NADPH are generated, and five 
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of the six carbons of glucose 6-phosphate emerge in pyruvate. The in- 
terplay of the glycolytic and pentose phosphate pathways enables the 
leveis of NADPH, ATP, and building blocks such as ribose 5-phos¬ 
phate and pyruvate to be continuously adjusted to meet cellular needs. 

• Glucose 6-phosphate Dehydrogenase Plays a Key Role in Protection 
Against Reactive Oxygen Species 

NADPH generated by glucose 6-phosphate dehydrogenase maintains 
the appropriate leveis of reduced glutathione required to combat ox- 
idative stress and maintain the proper reducing environment in the cell. 
Cells with diminished glucose 6-phosphate dehydrogenase activity are 
especially sensitive to oxidative stress. 


-1 KEY TERMS |- 

Calvin cycle (dark reactions) (p. 552) 
autotroph (p. 552) 
heterotroph (p. 552) 
rubisco (ribulose 1,5-bisphosphate 
carboxylase/oxygenase) (p. 553) 
peroxisome (p. 555) 
photorespiration (p. 556) 
starch (p. 559) 


sucrose (p. 559) 
thioredoxin (p. 560) 

G4 pathway (p. 561) 

G4 plant (p. 562) 

G3 plant (p. 562) 

Crassulacean acid metabolism (CAM) 
(p. 562) 

pentose phosphate pathway (p. 563) 


glucose 6-phosphate dehydrogenase 
(p. 564) 

transketolase (p. 564) 
transaldolase (p. 564) 
glutathione (p. 571) 
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-1 PROBLEMS I- 

1 . Variation on a theme. Sedoheptulose 1,7-bisphosphate is 
an intermediate in the Calvin cycle but not in the pentose 
phosphate pathway. What is the enzymatic basis of this 
difference? 

2. Total eclipse. An illuminated suspension of Chlorella is ac- 
tively carrying out photosynthesis. Suppose that the light is sud- 
denly switched off. How would the leveis of 3-phosphoglycerate 
and ribulose 1,5-bisphosphate change in the next minute? 

3. CO 2 deprivation. An illuminated suspension of Chlorella is 
actively carrying out photosynthesis in the presence of 1 % CO 2 . 
The concentration of CO 2 is abruptly reduced to 0.003%. What 
effect would this reduction have on the leveis of 3-phospho- 
glycerate and ribulose 1,5-bisphosphate in the next minute? 

4. A potent analog. 2 -Carboxyarabinitol 1,5-bisphosphate 
(CABP) has been useful in studies of rubisco. 

(a) Write the structural formula of CABP. 

(b) Which catalytic intermediate does it resemble? 

(c) Predict the effect of CABP on rubisco. 

5. Salvage operation. Write a balanced equation for the 
transamination of glyoxylate to yield glycine. 

6 . When one equals two. In the C 4 pathway, one ATP molecule 
is used in combining the CO 2 with phosphoenolpyruvate to form 
oxaloacetate (Figure 20.17), but, in the computation of energet- 
ies bookkeeping, two ATP molecules are said to be consumed. 
Explain. 


7. Dog days of August. Before the days of pampered lawns, most 
homeowners practiced horticultural Darwinism. A result was that 
the lush lawns of early summer would often convert into robust 
cultures of crabgrass in the dog days of August. Provide a pos- 
sible biochemical explanation for this transition. 

8 . Global warming. C 3 plants are most common in higher lati¬ 
tudes and become less common at latitudes near the equator. The 
reverse is true of C 4 plants. How might global warming affect 
this distribution? 

9. Tracing glucose. Glucose labeled with ^"'^C at C -6 is added to 
a solution containing the enzymes and cofactors of the oxidative 
phase of the pentose phosphate pathway. What is the fate of the 
radioactive labei? 

10 . Recurring decarboxylations. Which reaction in the citric acid 
cycle is most analogous to the oxidative decarboxylation of 6 - 
phosphogluconate to ribulose 5-phosphate? What kind of en- 
zyme-bound intermediate is formed in both reactions? 

11 . Carbon shuffling. Ribose 5-phosphate labeled with ^^C at 
C -1 is added to a solution containing transketolase, transaldol- 
ase, phosphopentose epimerase, phosphopentose isomerase, 
and glyceraldehyde 3-phosphate. What is the distribution of the 
radioactive labei in the erythrose 4-phosphate and fruetose 
6 -phosphate that are formed in this reaction mixture? 

12. Synthetic stoichiometries. What is the stoichiometry of syn- 
thesis of (a) ribose 5-phosphate from glucose 6 -phosphate with- 
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out the concomitant generation of NADPH; (b) NADPH from 
glucose 6 -phosphate without the concomitant formation of pen¬ 
tose sugars? 

13. Trapping a reactive lysine. Design a chemical experiment to 
identify the lysine residue that forms a Schiff base at the active 
site of transaldolase. 

14. Reductive power. What ratio of NADPH to NADP^ is re- 
quired to sustain [GSH] = 10 mM and [GSSG] = 1 mM? Use 
the redox potentials given in Table 18.1. 

Mechanism Problems 

15. An alternative approach. The mechanisms of some aldolases 
do not include Schiff-base intermediates. Instead, these enzymes 
require bound metal ions. Propose such a mechanism for the 
conversion of dihydroxyacetone phosphate and glyceraldehyde 
3-phosphate into fructose 1,6-bisphosphate. 

16. A recurring intermediate. Phosphopentose isomerase inter- 
converts the aldose ribose 5-phosphate and the ketose ribulose 
5-phosphate. Propose a mechanism. 

Chapter Integration Problems 

17. Catching carbons. Radioactive-labeling experiments can 
yield estimates of how much glucose 6 -phosphate is metabolized 
by the pentose phosphate pathway and how much is metabo¬ 
lized by the combined action of glycolysis and the citric acid cy¬ 
cle. Suppose that you have samples of two different tissues as 
well as two radioactively labeled glucose samples, one with glu¬ 
cose labeled with at C -1 and the other with glucose labeled 
with at C- 6 . Design an experiment that would enable you 
to determine the relative activity of the aerobic metabolism of 
glucose compared with metabolism by the pentose phosphate 
pathway. 

18. Photosynthetic efficiency. Use the following information to 
estimate the efficiency of photosynthesis. 

The AG°' for the reduction of CO 2 to the levei of hexose is 
+ 114 kcal moU^ ( + 477 kj moU^). 

A mole of 600-nm photons has an energv content of 47.6 kcal 
(199 kJ). 

Assume that the proton gradient generated in producing the re- 
quired NADPH is sufficient to drive the synthesis of the re- 
quired ATP. 


Date Interpretation Problem 

19. Graph A shows the photosynthetic activity of two species 
of plant, one a C 4 plant and the other a C 3 plant, as a function 
of leaf temper ature. 



(a) Which data were most likely generated by the C 4 plant and 
which by the C 3 plant? Explain. 

(b) Suggest some possible explanations for why the photosyn¬ 
thetic activity falis at higher temper atures. 

Graph B illustrates how the photosynthetic activity of C 3 
and G 4 plants varies with CO 2 concentration when temperature 
(30°C) and light intensity (high) are constant. 



(c) Why can C 4 plants thrive at CO 2 concentrations that do not 
support the growth of C 3 plants? 

(d) Suggest a plausible explanation for why C 3 plants continue 
to increase photosynthetic activity at higher CO 2 concentrations, 
whereas C 4 plants reach a plateau. 
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LBLE 22.1 

Principal reactions in fatty acid oxidation 


Step 

Reaction 

Enzyme 

1 

Fatty acid + CoA + ATP ,—^ 

acyl CoA + AMP + PP, 

Acyl CoA synthetase [also called fatty acid thiokinase 
and fatty acid:CoA ligase (AMP)] 

2 

Garnitine + acyl CoA ^ acyl carnitine + CoA 

Carnitine acyltransferase (also called carnitine 
palmitoyl transferase) 

3 

Acyl CoA + E-FAD ^ 

trans-A^-enoyl CoA + E-FADH 2 

Acyl CoA dehydrogenases (several isozymes having 
different chain-length specificity) 

4 

írans-A^-Enoyl CoA + H7O ^ ^ 

L-3-hydroxyacyl CoA 

Enoyl CoA hydratase (also called crotonase or 
3-hydroxyacyl CoA hydrolyase) 

5 

L-3-Hydroxyacyl CoA + NAD''' ^ 

3 Aetoacyl CoA + NADH + 

L-3-Hydroxyacyl CoA dehydrogenase 

6 

3-Ketoacyl CoA + CoA y 

acetyl CoA + acyl CoA (shortened by G 2 ) 

(3-Ketothiolase (also called thiolase) 
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TABLE 22.2 Principal reactions in fatty acid synthesis in bactéria 


Step 

Reaction 

Enzyme 

1 

Acetyl CoA + HCO^^ + ATP —^ 

malonyl CoA + ADP + P; + 

Acetyl CoA carboxylase 

2 

Acetyl CoA + ACP v acetyl ACP + CoA 

Acetyl transacylase 

3 

Malonyl CoA + ACP v malonyl ACP + CoA 

Malonyl transacylase 

4 

Acetyl ACP + malonyl ACP —> 

acetoacetyl ACP + ACP + CO 2 

Acyl-malonyl ACP condensing enzyme 

5 

Acetoacetyl ACP + NADPH + H" 

D-3-hydroxybutyryl ACP + NADP^ 

3-Ketoacyl ACP reductase 

6 

D-3-Hydroxybutyryl ACP v 

crotonyl ACP + H 2 O 

3-Hydroxyacyl ACP dehydratase 

7 

Crotonyl ACP + NADPH + H" ^ 

butyryl ACP + NADP" 

Enoyl ACP reductase 
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Sensory Systems 



Color perception. The photoreceptor rhodopsin (right), which absorbs 
light in the process of vision, consists of the protein opsin and a bound 
vitamin A derivative, retinal. The amino adds (shown in red) that surround 
the retinal determine the color of light that is most efficientiy absorbed. 
Individuais lacking a light-absorbing photoreceptor for the color green wili 
see a colorfui fruit stand (left) as mostiy yellows (middie). 

[Photographs from L. T. Sharpe, A. Stockman, H. Jagle, and J. Nathans. (1999) -Opsin 
genes, cone photopigments, color vision, and color blindness, in Color Vision: from 
Genes to Perception, pp. 3-51. K. Gegenfurtner, L. T. Sharpe, eds. Cambridge Univer- 
sity Press.] 


OUTLINE 


Our senses provide us with means for detecting a diverse set of externai sig- 
nals, often with incredible sensitivity and specificity. For example, when fully 
adapted to a darkened room, our eyes allow us to sense very low leveis of light, 
down to a limit of less than ten photons. With more light, we are able to distin- 
guish millions of colors. Through our senses of smell and taste, we are able to 
detect thousands of Chemicals in our environment and sort them into categories: 
pleasant or unpleasant? healthful or toxic? Finally, we can perceive mechanical 
stimuli in the air and around us through our senses of hearing and touch. 

How do our sensory systems work? How are the initial stimuli detected? 
How are these initial biochemical events transformed into perceptions and 

experiences? We have previously encountered systems that _ 

sense and respond to chemical signals—namely, receptors 
that bind to growth factors and hormones. Our knowledge 
of these receptors and their associated signal-transduction 
pathways provides us with concepts and tools for unravel- 
ing some of the workings of sensory systems. For example, 

7 TM receptors (seven-transmembrane receptors, Section 
IS.x.x) play key roles in olfaction, taste, and vision. lon 
channels that are sensitive to mechanical stress are essen- 
tial for hearing and touch. 

In this chapter, we shall focus on the five major sensory 
systems found in human beings and other mammals: ol¬ 
faction (the sense of smell; i.e., the detection of small mol- 
ecules in the air), taste or gustation (the detection of selected 
organic compounds and ions by the tongue), vision (the 
detection of light), hearing (the detection of sound, or 


32.1 A Wide Variety of Organic 
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32.2 Taste Is a Combination of Senses 
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FIGURE 32.1 Sensory connections to 
the brain. Sensory nen/es connect sensory 
organs to the brain and spinal cord. 



pressure waves in the air), and touch (the detection of changes in pressure, 
temperature, and other factors by the skin). Each of these primary sensory 
Systems contains specialized sensory neurons that transmit nerve impulses 
to the central nervous system (Figure 32.1). In the central nervous system, 
these signals are processed and combined with other Information to yield a 
perception that may trigger a change in behavior. By these means, our senses 
allow us to detect changes in our environments and to adjust our behavior 
appropriately. 


32.1 A WIDE VARIETY OF ORGANIC COMPOUNDS ARE 
DETECTED BY OLFACTION 


Human beings can detect and distinguish thousands of different compounds 
by smell, often with considerable sensitivity and specificity. Most odorants 
are relatively small organic compounds with sufficient volatility that they 
can be carried as vapors into the nose. For example, a major component re- 
sponsible for the smell of almonds is the simple aromatic compound ben- 
zaldehyde, whereas the sulfhydryl compound 3-methylbutane-l-thiol is a 
major component of the smell of skunks. 



Benzaldehyde 

(Almond) 


3-IVIethylbutane-1 -thiol 
(Skunk) 


Geraniol 

(Rose) 


Zingiberene 

(Ginger) 





















What properties of these molecules are responsible for their smells? 
First, the shape of the molecule rather than its other physical properties is cru¬ 
cial. We can most clearly see the importance of shape by comparing mole¬ 
cules such as those responsible for the smells of spearmint and caraway. 
These compounds are identical in essentially all physical properties such as 
hydrophobicity because they are exact mirror images of one another. Thus, 
the smell produced by an odorant depends not on a physical property but 
on the compound^s interaction with a specific binding surface, most likely 
a protein receptor. Second, some human beings (and other animais) suffer 
from specific anosmias; that is, they are incapable of smelling specific com¬ 
pounds even though their olfactory systems are otherwise normal. Such 
anosmias are often inherited. These observations suggest that mutations in 
individual receptor genes lead to the loss of the ability to detect a small sub- 
set of compounds. 

32.1.1 Olfaction Is Mediated by an Enormous Family of 
Seven-Transmembrane-Helix Receptors 

Odorants are detected in a specific region of the nose, called the main ol¬ 
factory epithelium, that lies at the top of the nasal cavity (Figure 32.2). Ap- 
proximately 1 million sensory neurons line the surface of this region. Cilia 
containing the odorant-binding protein receptors project from these neu¬ 
rons into the mucous lining of the nasal cavity. 

Biochemical studies in the late 1980s examined isolated cilia from rat ol¬ 
factory epithelium that had been treated with odorants. Exposure to the 
odorants increased the cellular levei of cAMP, and this increase was ob- 
served only in the presence of GTP. On the basis of what was known about 
signal-transduction systems, the participation of cAMP and GTP strongly 
suggested the involvement of a G protein and, hence, 7TM receptors. Indeed, 
Randall Reed purified and cloned a G protein a subunit, termed G(oif), 
which is uniquely expressed in olfactory cilia. The involvement of 7TM re¬ 
ceptors suggested a strategy for identifying the olfactory receptors them- 
selves. cDNAs were sought that (1) were expressed primarily in the sensory 
neurons lining the nasal epithelium, (2) encoded members of the 7TM re¬ 
ceptor family, and (3) were present as a large and diverse family to account 
for the range of odorants. Through the use of these criteria, cDNAs for odor¬ 
ant receptors from rats were identified in 1991 by Richard Axel and Linda 
Buck. 
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FIGURE 32.2 The main nasal 
epithelium. This region of the nose, 
which lies at the top of the nasal cavity, 
contains approximately 1 million sensory 
neurons. Nerve impulses generated by 
odorant molecules binding to receptors on 
the cilia travei from the sensory neurons 
to the olfactory bulb. 
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FIGURE 32.3 Evolutíon of odorant 
receptors. Odorant receptors appear to 
have lost function through conversion into 
pseudogenes in the course of primate 
evolution. The percentage of OR genes 
that appear to be functional for each 
species is shown in parentheses. 
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The odorant receptor (hereafter, OR) family is 
even larger than expected: more than 1000 OR 
genes are present in the mouse and the rat, whereas the hu- 
man genome encodes between an estimated 500 and 750 
ORs. The OR family is thus one of the largest gene fam- 
ilies in human beings. However, more than half the hu- 
man odorant receptor genes appear to be pseudogenes 
(Section Sl.x.x); that is, they contain mutations that pre- 
vent the generation of a full-length, proper odorant re¬ 
ceptor. In contrast, essentially all rodent OR genes are 
fully functional. Further analysis of primate OR genes 
reveals that the fraction of pseudogenes is greater in 
species more closely related to human beings (Figure 
32.3). Thus, we may have a glimpse at the evolutionary 
loss of acuity in the sense of smell as higher mammals 
presumably became less dependent on this sense for sur- 
vival. 

The OR proteins are typically 20% identical in sequence to the j8-adren- 
ergic receptor (Section 1 S.x.x) and from 30 to 60% identical with each other. 
Several specific sequence features are present in most or all OR family mem- 
bers (Figure 32.4). The central region, particularly transmembrane helices 
4 and 5, is highly variable, suggesting that this region is the site of odorant 
binding. That site must be different in odorant receptors that bind distinct 
odorant molecules. 


FIGURE 32.4 Conserved and varíant 
regíons in odorant receptors. Odorant 
receptors are members of the 7TI\/1 
receptor family. The green cylinders 
represent the seven presumed transmem¬ 
brane helices. Strongly conserved residues 
characteristic of this protein family are 
shown in blue, whereas highly variable 
residues are shown in red. 



What is the relation between OR gene expression and the individual 
neuron? Interestingly, each olfactory neuron expresses only a single OR gene, 
among hundreds available. Apparently, the precise OR gene expressed is 
determined largely at random. The mechanism by which all other OR genes 
are excluded from expression remains to be elucidated. The binding of an 
odorant to an OR on the neuronal surface initiates a signal-transduction 
Cascade that results in an action potential (Figure 32.5). The ligand-bound 
OR activates G(oif), the specific G protein mentioned earlier. G(oif) is ini- 
tially in its GDP-bound form. When activated, it releases GDP, binds GTP, 
and releases its associated jSy subunits. The a subunit then activates a spe¬ 
cific adenylate cyclase, increasing the intracellular concentration of cAMP. 
The rise in the intracellular concentration of cAMP activates a nonspecific 
cation channel that allows calcium and other cations into the cell. The flow 
of cations through the channel depolarizes the neuronal membrane and ini¬ 
tiates an action potential. This action potential, combined with those from 
other olfactory neurons, leads to the perception of a specific odor. 
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CONCEPTUAL INSIGHTS, Signalíng Pathways: Response and Recovery 

presents an animated version of Figure 32.5 and a comparison to visual signa! transduction 
(Figure 32.25). 

32.1.2 Odorants Are Decoded by a Combinatorial Mechanism 

An obvious challenge presented to the investigator by the large size of the 
OR family is to match up each OR with the one or more odorant molecules 
to which it binds. Exciting progress has been made in this regard. Initially, 
an OR was matched with odorants by overexpressing a single, specific OR 
gene in rats. This OR responded to straight-chain aldehydes, most favor- 
ably to n-octanal and less strongly to n-heptanal and n-hexanal. More dra- 
matic progress was made by taking advantage of our knowledge of the OR 
signal-transduction pathway and the power of PCR (Section 6.1.5). A sec- 
tion of nasal epithelium from a mouse was loaded with the calcium- 
sensitive dye Fura-2 (Section IS.x.x). The tissue was then treated with dif- 
ferent odorants, one at a time, at a specific concentration. If the odorant bound 
to and activated an OR, that neuron could be detected under a microscope 
by the change in fluorescence caused by the influx of calcium that occurs as 
part of the signal-transduction process. To determine which OR was respon- 
sible for the response, cDNA was generated from mRNA that had been iso- 
lated from single identified neurons. The cDNA was then subjected to PCR 
with the use of primers that are effective in amplifying 
most or all OR genes. The sequence of the PCR product 
from each neuron was then determined and analyzed. 

Using this approach, investigators analyzed the re¬ 
sponses of neurons to a series of compounds having vary- 
ing chain lengths and terminal functional groups (Figure 
32.6). The results of these experiments appear surpris- 
ing at first glance (Figure 32.7). Importantly, there is not 
a simple 1:1 correspondence between odorants and 
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FIGURE 32.6 Four series of odorants tested for olfactory 
receptor actívation. 
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FIGURE 32.5 The olfactory signal- 
transductíon cascade. The binding of 
odorant to the olfactory receptor activates 
a signalíng pathway similar to those initi- 
ated in response to the binding of some 
hormones to their receptors (see Section 
15.X.X.). The final result is the opening of 
cAMP-gated ion channels and the initia- 
tion of an action potential. 


FIGURE 32.7 Patterns of olfactory 
receptor actívation. Fourteen different 
receptors were tested for responsiveness 
to the compounds shown in Figure 32.6. 
A colored box indicates that the receptor 
at the top responded to the compound 
at the left. Darker colors indicate that 
the receptor was activated at a lower 
concentration of odorant. 
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FIGURE 32.8 Convergíng olfactory 
neurons. This section of the nasal cavity 
is stained to reveal processes from sensory 
neurons expressing the same olfactory 
receptor. The processes converge to a 
single location in the olfactory bulb. 

[From P. Mombaerts, F. Wang, C. Dulac, S. K. Chao, 
A. Nemes, M. Mendelsohn, J. Edmondson, and 
R. Axel. Cell 87(1996):675-689.] 



FIGURE 32.9 The Cyranose 320. 

The electronic nose may find uses in 
the food industry, animal husbandry, law 
enforcement, and medicine. [Courtesy of 
Cyrano Sciences.] 


receptors. Almost every odorant activates a number of receptors (usually to 
different extents) and almost every receptor is activated by more than one 
odorant. Note, however, that each odorant activates a unique combination 
of receptors. In principie, this combinatorial mechanism allows 
even a relatively small array of receptors to distinguish a vast number of 
odorants. 

How is the information about which receptors have been activated 
transmitted to the brain? Recall that each neuron expresses only one OR 
and that the pattern of expression appears to be largely random. A sub- 
stantial clue to the connections between receptors and the brain has been 
provided by the creation of mice that express a gene for an easily detectable 
colored marker in conjunction with a specific OR gene. Olfactory neurons 
that express the OR-marker protein combination were traced to their des- 
tination in the brain, a structure called the olfactory bulb (Figure 32.8). The 
processes from neurons that express the same OR gene were found to con- 
nect to the same location in the olfactory bulb. Moreover, this pattern of 
neuronal connection was found to be identical in all mice examined. Thus, 
neurons that express specific ORs are linked with specific sites in the brain. 
This property creates a spatial map of odorant-responsive neuronal activ- 
ity within the olfactory bulb. 

Can such a combinatorial mechanism truly distinguish many different 
odorants? An ‘‘electronic nose’' that functions by the same principies pro- 
vides compelling evidence that it can (Figure 32.9). The receptors for the 
electronic nose are polymers that bind a range of small molecules. Each 
polymer binds every odorant, but to varying degrees. Importantly, the elec- 
trical properties of these polymers change on odorant binding. A set of 32 
of these polymer sensors, wired together so that the pattern of responses can 
be evaluated, is capable of distinguishing individual compounds such as 
n-pentane and n-hexane as well as complex mixtures such as the odors of 
fresh and spoiled fruit. 

32.1.3 Functional Magnetic Resonance Imaging Reveals Regions 
of the Brain Processing Sensory Information 

Can we extend our understanding of how odorants are perceived to events 
in the brain? Biochemistry has provided the basis for powerful methods for 
examining responses within the brain. One method, called/unctíona/ mag¬ 
netic resonance imaging (fMRI), takes advantage of two key observations. 
The first is that, when a specific part of the brain is active, blood vessels re- 
lax to allow more blood flow to the active region. Thus, a more active re- 
gion of the brain will be richer in oxyhemoglobin. The second observation 
is that the iron center in hemoglobin undergoes substantial structural 
changes on binding oxygen (Section 10.4.1). These changes are associated 
with a rearrangement of electrons such that the iron in deoxyhemoglobin 
acts as a strong magnet, whereas the iron in oxyhemoglobin does not. The 
difference between the magnetic properties of these two forms of hemoglo¬ 
bin can be used to image brain activity. 

Nuclear magnetic resonance techniques (Section 4.5.1) detect signals 
that originate primarily from the protons in water molecules but are altered 
by the magnetic properties of hemoglobin. With the use of appropriate tech¬ 
niques, images can be generated that reveal differences in the relative 
amounts of deoxy- and oxyhemoglobin and thus the relative activity of var- 
ious parts of the brain. 

These noninvasive methods reveal areas of the brain that process sen¬ 
sory information. For example, subjects have been imaged while breathing 
air that either does or does not contain odorants. When odorants are pres- 






ent, the fMRI technique detects an increase in the levei of hemoglobin oxy- 
genation (and, hence, brain activity) in several regions of the brain (Figure 
32.10). Such regions include those in the primary olfactory córtex as well as 
other regions in which secondary processing of olfactory signals presum- 
ably takes place. Further analysis reveals the time course of activation of 
particular regions and other features. Functional MRI shows tremendous 
potential for mapping regions and pathways engaged in processing sensory 
information obtained from all the senses. Thus, a seemingly incidental as- 
pect of the biochemistry of hemoglobin has yielded the basis for observing the 
brain in action. 


32.2 TASTE IS A COMBINATION OF SENSES THAT 
FUNCTION BY DIFFERENT MECHANISIVIS 

The inability to taste food is a common complaint when nasal congestion 
reduces the sense of smell. Thus, smell greatly augments our sense of taste 
(also known as gustation), and taste is, in many ways, the sister sense to ol- 
faction. Nevertheless, the two senses differ from each other in several im- 
portant ways. First, we are able to sense several classes of compounds by 
taste that we are unable to detect by smell; salt and sugar have very little 
odor, yet they are primary stimuli of the gustatory system. Second, whereas 
we are able to discriminate thousands of odorants, discrimination by taste 
is much more modest. Five primary tastes are perceived: bitter, sweet, sour, 
salty, and umami (the taste of glutamate from the Japanese word for ‘‘deli- 
ciousness’'). These five tastes serve to classify compounds into potentially 
nutritive and beneficiai (sweet, salty, umami) or potentially harmful or toxic 
(bitter, sour). Tastants (the molecules sensed by taste) are quite distinct for 
the different groups (Figure 32.11). 



FIGURE 32.10 Brain response to 
odorants. A functional magnetic 
resonance image reveals brain response 
to odorants. The light spots indicate 
regions of the brain activated by odorants. 
[N. Sobel et al., 7. Neurophysiol. 83:537-551 
2000 537; courtesy of Nathan Sobel.] 



FIGURE 32.11 Examples of tastant molecules. Tastants fali into five groups: sweet, 
salty, umami, bitter, and sour. 


The simplest tastant, the hydrogen ion, is perceived as sour. Other sim- 
ple ions, particularly sodium ion, are perceived as salty. The taste called 
umami is evoked by the amino acid glutamate, often encountered as the fla- 
vor enhancer monosodium glutamate (MSG). In contrast, tastants perceived 
as bitter or sweet are extremely diverse. Many bitter compounds are alkaloids 
or other plant products of which many are toxic. However, they do not have 
any common structural elements or other common properties. Carbohy- 
drates such as glucose and sucrose are perceived as sweet, as are other com¬ 
pounds including some simple peptide derivatives, such as aspartame, and 
even some proteins. 






32-8 


CHAPTER 32 • Sensory Systems 




Bitter 


• • 

Sour 


Sweet 


Salty 


FIGURE 32.12 Taste-sensítíve areas 
of the tongue. The tongue contains 
overlapping regions that are particularly 
sensitive to specific taste stimuli. 
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Aspartame 

These differences in specificity among the five tastes are due to differ- 
ences in their underlying biochemical mechanisms. The sense of taste is, in 
fact, a number of independent senses all utilizing the same organ, the tongue, 
for their expression. 

Tastants are detected by specialized structures called taste buds, which 
contain approximately 150 cells, including sensory neurons. Projections 
called taste papillae contain numerous taste buds. Papillae somewhat spe¬ 
cialized to detect different taste types are concentrated in distinct regions 
across the surface of the tongue. Consequently, different regions of the 
tongue are particularly sensitive to specific classes of tastants (Figure 32.12). 

32.2.1 Sequencing the Human Genome Led to the Discovery 
of a Large Family of 7TIVI Bitter Receptors 


o 



6-n-Propyl-2-thiouracil 

(PROP) 


Just as in olfaction, a number of clues pointed to the involvement of G pro- 
teins and, hence, 7TM receptors in the detection of bitter and sweet tastes. 
The evidence included the isolation of a specific G protein a subunit termed 
gustducin, which is expressed primarily in taste buds (Figure 32.13). How 
could the 7TM receptors be identified? The ability to detect some com- 
pounds depends on specific genetic loci in both human beings and mice. 
For instance, the ability to taste the bitter compound 6-n-propyl-2- 
thiouracil (PROP) was mapped to a region on human chromosome 5 by 
comparing DNA markers of persons who vary in sensitivity to this 
compound. 

This observation suggested that this region might encode a 7TM re¬ 
ceptor that responded to PROP. Approximately 450 kilobases in this region 
had been sequenced early in the human genome project. This sequence was 
searched by Computer for potential 7 TM receptor genes, and, indeed, one 
was detected and named T2R-1. Additional database searches for sequences 
similar to this one detected 12 genes encoding full-length receptors as well 
as 7 pseudogenes within the sequence of the human genome known at 


FIGURE 32.13 Expression of gustducín 
in the tongue. (A) A section of tongue 
stained with a fluorescent antibody reveals 
the position of the taste buds. (B) The 
same region stained with a antibody 
directed against gustducin reveals that 
this G protein is expressed in taste buds. 
[Courtesy of Charles S. Zuker.] 
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FIGURE 32.14 Conserved and variant regions in bitter receptors. The bitter receptors 
are members of the TTIVl receptor family. Strongly conserved residues characteristic of this 
protein family are shown in blue, and highly variable residues are shown in red. 


the time. The encoded proteins were between 30 and 70% identical with 
T2R-1 (Figure 32.14). Further analysis suggests that there arefrom 50 to 100 
members of this family of 7TM receptors in the entire human genome. Similar 
sequences have been detected in the mouse and rat genomes. 

Are these proteins, in fact, bitter receptors? Several lines of evidence sug- 
gest that they are. First, their genes are expressed in taste-sensitive cells— 
in fact, in many of the same cells that express gustducin. Second, cells 
that express individual members of this family respond to specific bitter 
compounds. For example, cells that express a specific mouse receptor 
(mT2R-5) responded when exposed specifically to cycloheximide. Third, 
mice that had been found unresponsive to cycloheximide were found to have 
point mutations in the gene encoding mT2R-5. Finally, cycloheximide 
specifically stimulates the binding of GTP analogs to gustducin in the pres- 
ence of the mT2R-5 protein (Figure 32.15). 

Importantly, each taste receptor cell expresses many different members 
of the T2R family. This pattern of expression stands in sharp contrast to the 
pattern of one receptor type per cell that characterizes the olfactory system 
(Figure 32.16). The difference in expression patterns accounts for the much 
greater specificity of our perceptions of smells compared with tastes. We are 
ahle to distinguish among subtly different odors because each odorant stimulates 


Taste 



FIGURE 32.15 Evidence that T2R 
proteins are bitter taste receptors. 

Cycloheximide uniquely stimulates the 
binding of the GTP analog GTPyS to 
gustducin in the presence of the mT2R 
protein. [Adapted from J. Chandrashekar, K. L. 
Mueller, M. A. Hoon, E. Adier, L. Feng, W. Guo, 

C. S. Zuker, and N. J. Ryba. Cell 100(2000):703.] 


OLFACTION 



TASTE (bitter) 



FIGURE 32.16 Differing gene 
expression and connection patterns 
in olfactory and bitter taste receptors. 

In olfaction, each neuron expresses a single 
OR gene, and the neurons expressing the 
same OR converge to specific sites in 
the brain, enabling specific perception 
of different odorants. In gustation, each 
neuron expresses many bitter receptor 
genes, so the identity of the tastant is lost 
in transmission. 
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FIGURE 32.17 Schematk structure 
of the amiloríde-sensitíve sodium 
channel. Only one of the four subunits 
that constitute the functional channel is 
illustrated. The amiloride-sensitive sodium 
channel belongs to a superfamily having 
common structural features, including two 
hydrophobic membrane-spanning regions, 
intracellular amino and carboxyl termini; 
and a large, extracellular region with con- 
served cysteine-rich domains. 


a unique pattern of neurons. In contrast, many tastants stimulate the same neu- 
rons. Thus, we perceive only ‘‘bitter’' without the ability to discriminate cy- 
cloheximide from quinine. 

32.2.2 A Family of 7TIVI Receptors Almost Certainly 
Respond to Sweet Compounds 

Most sweet compounds are carbohydrates, energy rich and easily digestible. 
However, as noted in Chapter 11, carbohydrates are structurally diverse. 
Moreover, some noncarbohydrate compounds such as aspartame also taste 
sweet. The structural diversity among sweet-tasting compounds, though 
less than that among bitter compounds, strongly suggests that a family of 
receptors detects these compounds. The sweet receptors are very likely cou- 
pled to the G protein gustducin and, hence, are members of the 7TM super¬ 
family. This likelihood was most sharply revealed in studies that disrupted 
the gene for gustducin in mice. These mice lost much of their ability to sense 
both bitter and sweet compounds. Genes for candidate receptors have been 
cloned but have not yet been proved to be functional sweet receptors. 

32.2.3 Salty Tastes Are Detected Primarily by the Passage 
of Sodium lons Through Channels 

Salty tastants are not detected by 7 TM receptors. Rather, they are detected 
directly by their passage through ion channels expressed on the surface of 
cells in the tongue. Evidence for the role of these ion channels comes from 
examining known properties of sodium channels characterized in other bi- 
ological contexts. One class of channels, characterized first for their role in 
salt reabsorption, are thought to be important in salt taste detection because 
they are sensitive to the compound amiloride, which mutes the taste of salt 
and significantly lowers sensory neuron activation in response to sodium. 

An amiloride-sensitive sodium channel comprises four subunits that may 
be either identical or distinct but in any case are homologous. An individ¬ 
ual subunit ranges in length from 500 to 1000 amino acids and includes two 
presumed membrane-spanning helices as well as a large extracellular do- 
main in between them (Figure 32.17). The extracellular region includes two 
(or, sometimes, three) distinct regions rich in cysteine residues (and, pre- 
sumably, disulfide bonds). A region just ahead of the second membrane- 
spanning helix appears to form part of the pore in a manner analogous to 
the structurally characterized potassium channel (Section 13.5.6). The 
members of the amiloride-sensitive sodium-channel family are numerous 
and diverse in their biological roles. We shall encounter them again in the 
context of the sense of touch. 

Sodium ions passing through these channels produce a significant trans- 
membrane current. Amiloride blocks this current, accounting for its effect 
on taste. However, about 20% of the response to sodium remains even in 
the presence of amiloride, suggesting that other ion channels also contribute 
to salt detection. 

32.2.4 Sour Tastes Arise from the Effects 
of Hydrogen lons (Acids) on Channels 

Like salty tastes, sour tastes are also detected by direct interactions with ion 
channels, but the incoming ions are hydrogen ions (in high concentrations) 
rather than sodium ions. For example, in the absence of high concentrations 
of sodium, hydrogen ion flow can induce substantial transmembrane cur- 
rents through amiloride-sensitive sodium channels. However, hydrogen 
ions are also sensed by mechanisms other than their direct passage through 









membranes. Binding by hydrogen ions blocks some potassium channels and 
activates other types of channels. Together, these mechanisms lead to 
changes in membrane polarization in sensory neurons that produce the sen- 
sation of sour taste. 

32.2.5 Umami, the Taste of Glutamate, Is Detected by a 
Specialized Form of Glutamate Receptor 

Glutamate is an abundant amino acid that is present in protein-rich foods 
as well as in the widely used f lavor enhancer monosodium glutamate. This 
amino acid has a taste, termed umami, that is distinct from the other four 
basic tastes. Adults can detect glutamate at a concentration of approximately 
1 mM. Glutamate is also a widely used neurotransmitter, and thus, not sur- 
prisingly, several classes of receptors for glutamate have been identified in 
the nervous system. One class, called metabotrophic glutamate receptors, are 
7TM receptors with large amino-terminal domains of approximately 600 
amino acids. Sequence analysis reveals that the first half of the amino- 
terminal region is most likely a ligand-binding domain, because it is ho- 
mologous to such domains found in the Lac repressor (Section Sl.x.x) and 
other bacterial ligand-binding proteins. 

Y One glutamate receptor gene, encoding a protein called the 
metabotrophic glutamate receptor 4 (mGluR4), has been found to 
be expressed in taste buds. Further analysis of the mRNA that is expressed 
in taste buds reveals that this mRNA lacks the region encoding the first 309 
amino acids in brain mGluR4, which includes most of the high-affinity glu¬ 
tamate-binding domain (Figure 32.18). The glutamate receptor found in 
taste buds shows a lowered affinity for glutamate that is appropriate to glu¬ 
tamate leveis in the diet. Thus, the receptor responsible for the perception of 
glutamate taste appears to have evolved simply by changes in the expression of 
an existing glutamate-receptor gene. We shall consider an additional recep¬ 
tor related to taste, that responsible for the ‘Tot'' taste of spicy food, when 
we deal with mechanisms of touch perception. 
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FIGURE 32.18 Schematíc structure of 
a metabotrophic glutamate receptor. 

The umami receptor is a variant of a brain 
glutamate receptor. A substantial part of 
the high-affinity glutamate-binding domain 
(shown in yellow) is missing in the form 
expressed in the tongue. 


32.3 PHOTORECEPTOR MOLECULES 
IN THE EYE DETECT VISIBLE LIGHT 

Vision is based on the absorption of light by photoreceptor cells in the eye. 
These cells are sensitive to light in a relatively narrow region of the electro- 
magnetic spectrum, the region with wavelengths between 300 and 850 nm 
(Figure 32.19). Vertebrates have two kinds of photoreceptor cells, called rods 
and cones because of their distinctive shapes. Cones function in bright light 
and are responsible for color vision, whereas rods function in dim light but 
do not perceive color. A human retina contains about 3 million cones and 
100 million rods. Remarkably, a rod cell can respond to a single photon, and 
the brain requires fewer than 10 such responses to register the sensation of 
a flash of light. 
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FIGURE 32.19 The electromagnetíc 
spectrum. Visible light has wavelengths 
between 300 and 850 nanometers. 
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32.3.1 Rhodopsin, a Specialized 7TIVI Receptor, 

Absorbs Visible Light 

Rods are slender elongated structures; the outer segment is specialized for 
photoreception (Figure 32.20). It contains a stack of about 1000 discs, which 
are membrane-enclosed sacs densely packed with photoreceptor molecules. 
The photosensitive molecule is often called a visual pigment because it is 
highly colored owing to its ability to absorb light. The photoreceptor mol¬ 
ecule in rods is rhodopsin (Section 1 S.x.x), which consists of the protein opsin 
linked to 11-cis-retinal, a prosthetic group. 



Disks 


Outer segment 


FIGURE 32.20 The rod cell. (Left) Scanning electron micrograph of retinal rod cells. 
(Right) Schematic representation of a rod cell. [Photograph courtesy of Dr. Deric Bownds.] 



FIGURE 32.21 Rhodopsin absorptíon 
spectrum. 


Rhodopsin absorbs light very efficiently in the middle of the visible spec¬ 
trum, its absorption being centered on 500 nm, which nicely matches the 
solar output (Figure 32.21). A rhodopsin molecule will absorb a high per- 
centage of the photons of the correct wavelength that strike it, as indicated 
by the extinction coefficient of 40,000 M~^cm~^ at 500 nm. The extinction 
coefficient for rhodopsin is more than an order of magnitude greater than 
that for tryptophan, the most efficient absorber in proteins that lack pros¬ 
thetic groups. 

Opsin, the protein component of rhodopsin, is a member of the 7TM 
receptor family. Indeed, rhodopsin was the first member of this family to 
be purified, its gene was the first to be cloned and sequenced, and its three- 
dimensional structure was the first to be determined. The color of rhodopsin 
and its responsiveness to light depend on the presence of the light-absorb- 
ing group (chromophore) 11-ds-retinal. This compound is a powerful ab¬ 
sorber of light because it is a polyene; its six alternating single and double 
bonds constitute a long, unsaturated electron network. Recall that alternat¬ 
ing single and double bonds account for the chromophoric properties of 
chlorophyll (Section 20.x.x). The aldehyde group of 11-ds-retinal forms a 
Schiff base (Figure 32.22) with the e-amino group of lysine residue 296, 
which lies in the center of the seventh transmembrane helix. Free retinal ab¬ 
sorbs maximally at 370 nm, and its unprotonated Schiff-base adduct ab¬ 
sorbs at 380 nm, whereas the protonated Schiff base absorbs at 440 nm or 
longer wavelengths. Thus, the 500-nm absorption maximum for rhodopsin 
strongly suggests that the Schiff base is protonated; additional interactions 



















Schiff base 
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FIGURE 32.22 Retinal-lysíne línkage. Retinal is linked to lysine 296 in opsin by a 
Schiff-base linkage. In the resting State of rhodopsin, this Schiff base is protonated. 


with opsin shift the absorption maximum farther toward the red. The pos¬ 
itive charge of the protonated Schiff base is compensated by the negative 
charge of glutamate 113 located in helix 2; the glutamate residue closely ap- 
proaches the lysine-retinal linkage in the three-dimensional structure of 
rhodopsin. 


STRUCTURAL INSIGHTS, Rhodopsin: A G Protein Coupled 7TM Receptor 

offers a more detailed look at rhodopsin structure and function (Figure 32.5). 


32.3.2 Light Absorption Induces a Specific Isomerization 
of Bound ll-c/s-Retinal 

How does the absorption of light by the retinal Schiff base generate a sig- 
nal? George Wald and his coworkers discovered that light absorption results 
in the isomerization of the 11-cis-retinal group of rhodopsin to its a//-trans 
form (Figure 32.23). This isomerization causes the Schiff-base nitrogen atom 
to move approximately 5 A, assuming that the cyclohexane ring of the reti¬ 
nal group remains fixed. In essence, the light energy of a photon is converted 
into atomic motion. The change in atomic positions, like the binding of a 
ligand to other 7TM receptors, sets in train a series of events that lead to 
the closing of ion channels and the generation of a nerve impulse. 

The isomerization of the retinal Schiff base takes place within a few 
picoseconds of a photon being absorbed. The initial product, termed 
bathorhodopsin, contains a strained all-trans-retinal group. Within approx¬ 
imately 1 millisecond, this intermediate is converted through several addi- 
tional intermediates into metarhodopsin 11. In metarhodopsin II, the Schiff 
base is deprotonated and the opsin protein has undergone significant 
reorganization. 




FIGURE 32.23 Atomic motion in retinal. The Schiff-base nitrogen atom moves 5 Â as 
a consequence of the light-induced isomerization of 11-c/s-retinal to all-frons-retinal by 
rotation about the bond shown in red. 
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FIGURE 32.24 Analogous 7TIVI 
receptors. The conversion of rhodopsin 
into metarhodopsin II activates a signal- 
transduction pathway analogousiy to the 
activation induced by the binding of other 
TTIVI receptors to appropriate ligands. 


Metarhodopsin II (also referred to as R^) is analogous to the ligand- 
bound State of 7TM receptors such as the j 82 -adrenergic receptor (Section 
IS.x.x) and the odorant and tastant receptors heretofore discussed (Figure 
32.24). Like these receptors, this form of rhodopsin activates a het- 
erotrimeric G protein that propagates the signal. The G protein associated 
with rhodopsin is called transducin. Metarhodopsin II triggers the exchange 
of GDP for GTP by the a subunit of transducin (Figure 32.25). On the 
binding of GTP, the jSy subunits of transducin are released and the a sub¬ 
unit switches on a cGMP phosphodiesterase by binding to and removing an 
inhibitory subunit. The activated phosphodiesterase is a potent enzyme that 
rapidly hydrolyzes cGMP to GMP. The reduction in cGMP concentration 
causes cGMP-gated ion channels to dose, leading to hyperpolarization of 
the membrane and neuronal signaling. At each step in this process, the ini- 
tial signal—the absorption of a single photon—is amplified so that it leads to 
sufficient membrane hyperpolarization to result in signaling. 



FIGURE 32.25 Visual signal transduction. The light-índuced activation of rhodopsin 
leads to the hydrolysis of cGMP, which in turn leads to ion channel closing and the 
initiation of an action potential. 
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CONCEPTUAL INSIGHTS, Signaling Pathways: Response and Recovery 

presents an animated version of Figure 32.25 and a comparison to olfactory 
signal transduction (Figure 32.5). 


32.3.3 Light-Induced Lowering of the 
Calcium Levei Coordinates Recovery 

As we have seen, the visual system responds to changes in light and color 
within a few milliseconds, quickly enough that we are able to perceive con- 
tinuous motion at nearly 1000 frames per second. To achieve a rapid re¬ 
sponse, the signal must also be terminated rapidly and the system must be 
returned to its initial State. First, activated rhodopsin must be blocked from 
continuing to activate transducin. Rhodopsin kinase catalyzes the phospho- 
rylation of the carboxyl terminus of at multiple serine and threonine 
residues. Arrestin, an inhibitory protein (Section IS.x.x), then binds phos- 
phorylated R^ and prevents additional interaction with transducin. 

Second, the a subunit of transducin must be returned to its inactive State 
to prevent further signaling. Like other G proteins, the a subunit possesses 
built-in GTPase activity that hydrolyzes bound GTP to GDP. Hydrolysis 
takes place in less than a second when transducin is bound to the phospho¬ 
diesterase. The GDP form of transducin then leaves the phosphodiesterase 
and reassociates with the jSy subunits, and the phosphodiesterase returns to 
its inactive state. Third, the levei of cGMP must be raised to reopen the 
cGMP-gated ion channels. The action of guanylate cyclase accomplishes this 
third step by synthesizing cGMP from GTP. 



















Calcium ion plays an essential role in controlling guanylate cyclase be- ^ 32 - 15 1 

cause it markedly inhibits the activity of the enzyme. In the dark, Ca^^ as Vision 

well as Na^ enter the rod outer segment through the cGMP-gated chan- 
nels. Calcium ion influx is balanced by its efflux through an exchanger, a 
transport system that uses the thermodynamically favorable flow of four 
Na^ ions into the cell and one ion out of the cell to extrude one Ca^^ 
ion. After illumination, the entry of Ca^^ through the cGMP-gated chan- 
nels stops, but its export through the exchanger continues. Thus, the cy- 
tosolic Ca^^ levei drops from 500 nM to 50 nM after illumination. This 
drop markedly stimulates guanylate cyclase, rapidly restoring the concen- 
tration of cGMP to reopen the cGMP-gated channels. 


Activation 

Ion 

[cGI\/lP]f -> channels 

closed 


Recovery 

rc 2+11 _^ Guanylate cyclase 
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By controlling the rate of cGMP synthesis, leveis govern the speed with 
which the system is restored to its initial State. 


32.3.4 Color Vision Is Mediated by Three Cone Receptors 
That Are Homologs of Rhodopsin 

Cone cells, like rod cells, contain visual pigments. Like rhodopsin, these 
photoreceptor proteins are members of the 7 TM receptor family and uti¬ 
lize 11 -ds-retinal as their chromophore. In human cone cells, there are three 
distinct photoreceptor proteins with absorption maxima at 426, 530, and 
— 560 nm (Figure 32.26). These absorbances correspond to (in fact, define) 
the blue, green, and red regions of the spectrum. Recall that the absorption 
maximum for rhodopsin is 500 nm. 

The amino acid sequences of the cone photoreceptors have been com- 
pared with each other and with rhodopsin. The result is striking. Each of 
the cone photoreceptors is approximately 40% identical in sequence with 
rhodopsin. Similarly, the blue photoreceptor is 40% identical with each of 
the green and red photoreceptors. The green and red photoreceptors, how- 
ever, are > 95% identical with each other, differing in only 15 of 364 posi- 
tions (Figure 32.27). 



Wavelength (nm) 

FIGURE 32.26 Cone-pigment absorp¬ 
tion spectra. The absorption spectra of 
the cone visual pigment responsible for 
color Vision. 



FIGURE 32.27 Comparíson of the amino acid sequences of the green and red pho¬ 
toreceptors. Open circies correspond to identical residues, whereas colored circies mark 
residues that are different. The differences in the three black positions are responsible for 
most of the difference in their absorption spectra. 
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FIGURE 32.28 Evolutíonary relatíon- 
shíps among visual pígments. Visual 
pigments have evolved by gene duplication 
along different branches of the animal 
evolutionary tree. The branch lengths of 
the "trees" correspond to the percentage 
of amino acid divergence. [Adapted from 
Nathans, J. Neuron 24(1999):299; by permission 
of Cell Press.] 
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Y These observations are sources of insight into photoreceptor evolu- 
tion. First, the green and red photoreceptors are clearly products of 
a recent evolutionary event (Figure 32.28). The green and red pigments ap- 
pear to have diverged in the primate lineage approximately 3 5 million years 
ago. Mammals, such as dogs and mice, that diverged from primates earlier 
have only two cone photoreceptors, blue and green. They are not sensitive 
to light as far toward the infrared region as we are, and they do not dis- 
criminate colors as well. In contrast, birds such as chickens have a total of 
six pigments: rhodopsin, four cone pigments, and a pineal visual pigment 
called pinopsin. Birds have highly acute color perception. 

Second, the high levei of similarity between the green and red pigments 
has made it possible to identify the specific amino acid residues that are re- 
sponsible for spectral tuning. Three residues (at positions 180, 277, and 285) 
are responsible for most of the difference between the green and red pig¬ 
ments. In the green pigment, these residues are alanine, phenylalanine, and 
alanine, respectively; in the red pigment, they are serine, tyrosine, and thre- 
onine. A hydroxyl group has been added to each amino acid in the red pig¬ 
ment. The hydroxyl groups can interact with the photoexcited State of reti- 
nal and lower its energy, leading to a shift toward the lower-energy (red) 
region of the spectrum. 


\ explor 


STRUCTURAL INSIGHTS, Rhodopsin: A G Proteín Coupled 7TM Receptor 

explores the structural basis of color vision and night blindness in more detail. 


I- 

Homologous recombination— 

The exchange of DNA segments at 
equivalent positions between chromo- 
somes with substantial sequence 
similarity. 


32.3.5 Rearrangements in the Genes for the Green and 
Red Pigments Lead to "Color Blindness" 

7 The genes for the green and red pigments lie adjacent to each other 
on the human X chromosome. These genes are more than 98% iden- 
tical in nucleotide sequence, including introns and untranslated regions as 
well as the protein-coding region. Regions with such high similarity are very 
susceptible to unequal homologous recombination. 

Recombination can take place either between or within transcribed re¬ 
gions of the gene (Figure 32.29). If recombination takes place between tran¬ 
scribed regions, the product chromosomes will differ in the number of pig¬ 
ment genes that they carry. One chromosome will lose a gene and thus may 
lack the gene for, say, the green pigment; the other chromosome will gain a 
gene. Consistent with this scenario, approximately 2% of human X chro¬ 
mosomes carry only a single color pigment gene, approximately 20% carry 
two, 50% carry three, 20% carry four, and 5% carry five or more. A person 
lacking the gene for the green pigment will have trouble distinguishing red 
and green color, characteristic of the most common form of color blindness. 




















(A) Recombination between genes 
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Hearing 


FIGURE 32.29 Recombination pathways leading to color blindness. Rearrangements 
in the course of DNA replication may lead to (A) the loss of visual pigment genes or (B) the 
formation of hybrid pigment genes that encode photoreceptors with anomolous absorption 
spectra. Because the amino acids most important for determining absorption spectra are in 
the carboxyl-terminal half of each photoreceptor protein, the part of the gene that encodes 
this region most strongly affects the absorption characteristics of hybrid receptors. [Adapted 
from J. Nathans. Neuron 24(1999);299-312; by permission of Cell Press.] 


Approximately 5% of males have this form of color blindness. Recombina¬ 
tion can also take place within the transcription units, resulting in genes that 
encode hybrids of the green and red photoreceptors. The absorption maxi- 
mum of such a hybrid lies between that of the red and green pigments. A 
person with such hybrid genes who also lacks either a functional red or a 
functional green pigment gene does not discriminate color well. 


32.4 HEARING DEPENDS ON THE SPEEDY DETECTION 
OF MECHANICAL STIMULI 


Hearing and touch are based on the detection of mechanical stimuli. Although 
the proteins of these senses have not been as well characterized as those of 
the senses already discussed, anatomical, physiological, and biophysical stud- 
ies have elucidated the fundamental processes. A major clue to the mechanism 
of hearing is its speed. We hear frequencies ranging from 200 to 20,000 Hz 
(cycles per second), corresponding to times of 5 to 0.05 ms. Furthermore, our 
ability to locate sound sources, one of the most important functions of hear¬ 
ing, depends on the ability to detect the time delay between the arrival of a 
sound at one ear and its arrival at the other. Given the separation of our ears 
and the speed of sound, we must be able to accurately sense time differences 
of 0.7 ms. In fact, human beings can locate sound sources associated with 
temporal delays as short as 0.02 ms. This high time resolution implies that 
hearing must employ direct transduction mechanisms that 
do not depend on second messengers. Recall that, in 
vision, for which speed also is important, the signal-trans- 
duction processes take place in milliseconds. 


FIGURE 32.30 Haír cells, the sensory 
neurons crucial for hearing. [Adapted 
from Hudspeth, A. J. Nature 341 (1989):397.] 


32.4.1 Hair Cells Use a Connected Bundie 
of Stereocilia to Detect Tiny Motions 

Sound waves are detected inside the cochlea of the inner 
ear. The cochlea is a fluid-filled, membranous sac that 
is coiled like a snail shell. The primary detection is ac- 
complished by specialized neurons inside the cochlea 
called hair cells (Figure 32.30). Each cochlea contains 
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FIGURE 32.31 An electron mícrograph 
of a hair bundie. [Courtesy of a. Jacobs 
and A. J. Hudspeth.] 
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FIGURE 32.32 Micromanipulation of 
a hair cell. Movement toward the tallest 
part of the bundie depolarizes the cell as 
measured by the microelectrode. Move- 
ment toward the shortest part hyper- 
polarizes the cell. Lateral movennent has 
no effect. [Adapted fronn Hudspeth, A. J. 
Nature 341 (1989) :397.] 



FIGURE 32.33 Electron mícrograph of 
tip links. The tip link between two hair 
fibers is marked by an arrow. [Courtesy of 
A. Jacobs and A. J. Hudspeth.] 




FIGURE 32.34 Model for hair-cell 
transduction. When the hair bundie is 
tipped toward the tallest part, the tip link 
pulls on and opens an ion channel. Move- 
ment in the opposite direction relaxes the 
tension in the tip link, increasing the 
probability that any open channeis will 
dose. [Adapted from A. J. Hudspeth. Noture 341 
(1989);397.] 


approximately 16,000 hair cells, and each hair cell contains a hexagonally 
shaped bundie of 20 to 300 hairlike projections called stereocilia (Figure 
32.31). These stereocilia are graded in length across the bundie. Mechani- 
cal deflection of the hair bundie, as occurs when a sound wave arrives at the 
ear, creates a change in the membrane potential of the hair cell. 

Micromanipulation experiments have directly probed the connection 
between mechanical stimulation and membrane potential. Displacement to¬ 
ward the direction of the tallest part of the hair bundie results in depolar- 
ization of the hair cell, whereas displacement in the opposite direction re¬ 
sults in hyperpolarization (Figure 32.32). Motion perpendicular to the 
hair-length gradient does not produce any change in resting potential. Re- 
markably, displacement of the hair bundie by as little as 3 A (0.3 nm) results 
in a measurable (and functionally important) change in membrane potential. 
This motion of 0.003 degree corresponds to a 1-inch movement of the top 
of the Empire State Building. 

How does the motion of the hair bundie create a change in membrane 
potential? The rapid response, within microseconds, suggests that the 
movement of the hair bundie acts on ion channeis directly. An important 
observation is that adjacent stereocilia are linked by individual filaments 
called tip links (Figure 32.33). 

The presence of these tip links suggests a simple mechanical model for 
transduction by hair cells (Figure 32.34). The tip links are coupled to ion 
channeis in the membranes of the stereocilia that are gated by mechanical 
stress. In the absence of a stimulus, approximately 15% of these channeis 
are open. When the hair bundie is displaced toward its tallest part, the stere¬ 
ocilia slide across one another and the tension on the tip links increases, 
causing additional channeis to open. The flow of ions through the newly 
opened channeis depolarizes the membrane. Conversely, if the displacement 
is in the opposite direction, the tension on the tip links decreases, the open 
channeis dose, and the membrane hyperpolarizes. Thus, the mechanical mo¬ 
tion of the hair bundie is directly converted into current flow across the hair- 
cell membrane. 








32.4.2 A Candidate Mechanosensory Channel 
Has Been Identified in Drosophila 

A likely ortholog of the transduction channel in hearing has been identified 
in fruit flies. Drosophila have sensory bristles used for detecting small air 
currents. These bristles respond to mechanical displacement in ways simi¬ 
lar to those of hair cells; displacement of a bristle in one direction leads to 
substantial transmembrane current. To isolate the gene encoding the trans¬ 
duction channel, investigators isolated 27 different strains of mutant fruit 
flies that showed uncoordinated motion and clumsiness and then examined 
their electrophysiological responses to displacement of the sensory bristles. 
In one set of strains, transmembrane currents were dramatically reduced. 
The mutated gene in these strains was found to encode a protein of 1619 
amino acids, called NompC for no mechanoreceptor potential. 

The carboxyl-terminal 469 amino acids of NompC resemble a class of 
ion channel proteins called TRP (transient receptor potential) channels. 
This region includes six putative transmembrane helices with a porelike re- 
gion between the fifth and sixth helices. The amino-terminal 1150 amino 
acids consist almost exclusively of 29 ankyrin repeats (Figure 32.35). 
Ankyrin repeats are structural motifs formed by 33 amino acids folded into 
a hairpin loop followed by a helix-turn-helix. Importantly, in other proteins, 
regions with tandem arrays of these motifs mediate protein-protein inter- 
actions. Although the role of the ankyrin repeats in NompC has not yet been 
established, they likely mediate interactions with other components to cou- 
ple mechanical motion to conformational changes in the channel. The C. 
elegans genome encodes an orthologous protein that also is expressed in 
mechanosensory neurons. Given the mechanistic parallels, homologous se- 
quences from the human genome are strong candidates for genes encoding 
the transduction channel in hearing. 



FIGURE 32.35 Ankyrin repeat structure. Four ankyrin repeats are shown with 
one shown in red. These domains interact with other proteins, primarily through their loops. 


32.5 TOUCH INCLUDES THE SENSING OF PRESSURE, 
TEMPERATURE, AND OTHER FACTORS 

Like taste, touch is a combination of sensory systems that are expressed in 
a common organ—in this case, the skin. The detection of pressure and the 
detection of temperature are two key components. Amiloride-sensitive 
sodium channels, homologous to those of taste, appear to play a role. Other 
Systems are responsible for detecting painful stimuli such as high tempera- 
ture, acid, or certain specific Chemicals. Although our understanding of this 
sensory system is not as advanced as that of the other sensory systems, re- 
cent Work has revealed a fascinating relation between pain and taste sensa- 
tion, a relation well known to anyone who has eaten ‘‘spicy’' food. 





-1 32-20 1- 

CHAPTER 32 • Sensory Systems 



FIGURE 32.36 The membrane topology 
deduced for VR1, the capsaídn 
receptor. The proposed site of the 
membrane pore is indicated in red, and 
the three ankyrin (A) repeats are shown in 
orange. The active receptor comprises four 
of these subunits. [Adapted from Caterina, 

M. J., Schumacher, M. A., Tominaga, M., Rosen, 

T. A., Levine, J. D., and Julius, D. Noture 389 
(1997):816.] 


32.5.1 Studies of Capsaicin, the Active Ingredient in "HoX" 
Peppers, Reveal a Receptor for Sensing High Temperatures 
and Other PainfuI Stimuli 

Our sense of touch is intimately connected with the sensation of pain. Spe- 
cialized neurons, termed nociceptors, transmit signals to pain-processing 
centers in the spinal cord and brain in response to the onset of tissue dam- 
age. What is the molecular basis for the sensation of pain? An intriguing 
clue carne from the realization that capsaicin, the chemical responsible for 
the ‘‘hot'' taste of spicy food, activates nociceptors. 



Capsaicin 


Early research suggested that capsaicin would act by opening ion chan- 
nels that are expressed in nociceptors. Thus, a cell that expresses the cap¬ 
saicin receptor should take up calcium on treatment with the molecule. This 
insight led to the isolation of the capsaicin receptor with the use of cDNA 
from cells expressing this receptor. Such cells had been detected by their 
fluorescence when loaded with the calcium-sensitive compound Fura-2 and 
then treated with capsaicin or related molecules. Cells expressing the cap¬ 
saicin receptor, which is called VRl (for fanilloid receptor 1), respond to 
capsaicin below a concentration of 1 jaM. The deduced 838-residue se- 
quence of VRl revealed it to be a member of the TRP channel family (Figure 

32.36) . The amino-terminal region of VRl includes three ankyrin repeats. 

Currents through VRl are also induced by temperatures above 40°C and 

by exposure to dilute acid, with a midpoint for activation at pH 5.4 (Figure 

32.37) . Temperatures and acidity in these ranges are associated with infec- 
tion and cell injury. The responses to capsaicin, temperature, and acidity are 
not independent. The response to heat is greater at lower pH, for example. 
Thus, VRl acts to integrate several noxious stimuli. We feel these responses 
as pain and act to avoid the potentially destructive conditions that caused the 
unpleasant sensation. Mice that do not express VRl suggest that this is the 
case; such mice do not mind food containing high concentrations of capsaicin 
and are, indeed, less responsive than control mice to normally noxious heat. 
Plants such as chili peppers presumably gained the ability to synthesize cap¬ 
saicin and other ‘Tot'' compounds to protect themselves from being con- 
sumed by mammals. Birds, which play the beneficiai role of spreading pep- 
per seeds into new territory, do not appear to respond to capsaicin. 


FIGURE 32.37 Response of the 
capsaicin receptor to pH and temper¬ 
ature. [Adapted from Tominaga, M., Caterina, 
M. J., Maimberg, A. B., Rosen, T. A., Gilbert, H., 
Skinner, K., Raumann, B. E., Basbaum, A. I., and 
Julius, D. Neuron 21 (1998):531.] 


































T Because of its ability to simulate VRl, capsaicin is used in pain man- 
agement for arthritis, neuralgia, and other neuropathies. How can 
a compound that induces pain assist in its alleviation? Chronic exposure to 
capsaicin overstimulates pain-transmitting neurons, leading to their desen- 
sitization. 

32.5.2 Subtie Sensory Systems Detect Other Environmental 
Factors Such as Earth's Magnetic Field 

In addition to the five primary senses, human beings may have counterparts 
to less-familiar sensory systems characterized in other organisms. These 
sensory systems respond to environmental factors other than light, molec¬ 
ular shape, or air motion. For example, some species of bactéria are mag- 
netotactic; that is, they move in directions dictated by Earth’s magnetic field 
(Figure 32.38). In the Northern Hemisphere, Earth s magnetic field points 
northward but also has a component directed downward, toward Earth s 
center. Magnetotactic bactéria not only swim northward but also swim 
downward, away from the surface and the presence of high leveis of oxy- 
gen, toxic to these bactéria. Remarkably, these bactéria synthesize intracel- 
lular chains of small particles containing a magnetic ore called magnetite 
(Fe 304 ) that run through the center of each bacterium. Such chains are 
called magnetosomes. The magnetic force exerted by these particles is suffi- 
ciently strong in relation to the size of the bacterium that it causes the bac¬ 
terium to become passively aligned with Earth s magnetic field. Intrigu- 
ingly, similar magnetite particles have been detected in the brains of birds, 
fish, and even human beings, although their role in sensing magnetic fields 
has not yet been established. 

There may exist other subtie senses that are able to detect environmen¬ 
tal signals that then influence our behavior. The biochemical basis of these 
senses is now under investigation. One such sense is our ability to respond, 
often without our awareness, to chemical signals called pheromones, re- 
leased by other persons. Another is our sense of time, manifested in our 
daily (circadian) rhythms of activity and restfulness. Daily changes in light 
exposure strongly influence these rhythms. The foundations for these senses 
have been uncovered in other organisms; future studies should reveal to 
what extent these mechanisms apply to human beings as well. 
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FIGURE 32.38 Magnetotactic 
bacterium. The magnetosome, visible 
as a Chain of opaque membrane-bound 
magnetite crystals, acts as a compass 
to orient the bactéria with the earth's 
magnetic field. The bacterium is artificially 
colored. [Courtesy of Richard B. Frankel, Califór¬ 
nia Polytechnic State University, San Luis Obispo, 
Califórnia.] 


SUMMARY 


Smell, taste, vision, hearing, and touch are based on signal-transduction 
pathways activated by signals from the environment. 

These sensory systems function similarly to the signal-transduction 
pathways for many hormones. These intercellular signaling pathways 
appear to have been appropriated and modified to process environ¬ 
mental Information. 

A Wide Variety of Organic Compounds Are Detected by Olfaction 

The sense of smell, or olfaction, is remarkable in its specificity—it can, 
for example, discern stereoisomers of small organic compounds as dis- 
tinct aromas. The 7 TM receptors that detect these odorants operate in 
conjunction with G(oif), a G protein that activates a cAMP cascade re- 
sulting in the opening of an ion channel and the generation of a nerve 
impulse. An outstanding feature of the olfactory system is its ability to 
detect a vast array of odorants. Each olfactory neuron expresses only 
one type of receptor and connects to a particular region of the olfactory 
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receptors are activated by more than one odorant. 

• Taste Is a Combination of Senses That Function by Different Mechanisms 

We can detect only five tastes: bitter, sweet, salt, sour, and umami. The 
transduction pathways that detect taste are, however, diverse. Bitter and 
sweet tastants are experienced through 7 TM receptors acting through 
a special G protein called gustducin. Salty and sour tastants act directly 
through membrane channels. Salt tastants are detected by passage 
though sodium channels, whereas sour taste results from the effects of 
hydrogen ions on a number of types of channels. The end point is the 
same in all cases—membrane polarization that results in the transmis- 
sion of a nerve impulse. Umami, the taste of glutamate, is detected by 
a receptor that is a modified form of a brain receptor that responds to 
glutamate as a neurotransmitter rather than as a tastant. 

• Photoreceptor Molecules in the Eye Detect Visible Light 

Vision is perhaps the best understood of the senses. Two classes of pho¬ 
toreceptor cells exist: cones, which respond to bright lights and colors, 
and rods, which respond only to dim light. The photoreceptor in rods 
is rhodopsin, a 7TM receptor that is a complex of the protein opsin and 
the chromophore 11-ds-retinal. Absorption of light by 11-ds-retinal 
changes its structure into that of all-trans-retinal, setting in motion a 
signal-transduction pathway that leads to the breakdown of cGMP, to 
membrane hyperpolarization, and to a subsequent nerve impulse. Color 
vision is mediated by three distinct 7 TM photoreceptors that employ 
11 -ds-retinal as a chromophore and absorb light in the blue, green, and 
red parts of the spectrum. 

• Hearing Depends on the Speedy Detection of Mechanical Stimuli 

The immediate receptors for hearing are found in the hair cells of the 
cochleae, which contain bundles of stereocilia. When the stereocilia 
move in response to sound waves, cation channels will open or dose, 
depending on the direction of movement. The mechanical motion of the 
cilia is converted into current flow and then into a nerve impulse. 

• Touch Includes the Sensing of Pressure, Temperature, and Other Factors 

Touch, detected by the skin, senses pressure, temperature, and pain. Spe- 
cialized nerve cells called nociceptors transmit signals that are interpreted 
in the brain as pain. A receptor responsible for the perception of pain has 
been isolated on the basis of its ability to bind capsaicin, the molecule re¬ 
sponsible for the hot taste of spicy food. The capsaicin receptor, also 
called VRl, functions as a cation channel that initiates a nerve impulse. 
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-1 PROBLEMS I- 

1. Of mice and rats. As noted in Section 32.1.2, one of the First 
odorant receptors to be matched with its ligand was a rat recep¬ 
tor that responded best to n-octanal. The sequence of the corre- 
sponding mouse receptor differed from the rat receptor at 15 po- 
sitions. Surprisingly, the mouse receptor was found to respond 
best to n-heptanal rather than n-octanal. The substitution of 
isoleucine at position 206 in the mouse for valine at this position 
in the rat receptor was found to be important in determining the 
specificity for n-heptanal. Propose an explanation. 

2. Olfaction in worms. Unlike the olfactory neurons in the mam¬ 
malian Systems discussed herein, olfactory neurons in the nem- 


atode C. elegans express multiple olfactory receptors. In partic¬ 
ular, one neuron (called AWA) expresses receptors for compounds 
to which the nematode is attracted, whereas a different neuron 
(called AWB) expresses receptors for compounds that the nem¬ 
atode avoids. Suppose that a transgenic nematode is generated 
such that one of the receptors for an attractant is expressed in 
AWB rather than AWA. What behavior would you expect in the 
presence of the corresponding attractant? 

3. Odorant matching. A mixture of two of the compounds 
illustrated in Figure 32.6 is applied to a section of olfactory 
epithelium. Only receptors 3, 5, 9, 12, and 13 are activated, 
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according to Figure 32.7. Identify the likely compounds in the 
mixture. 

4. Timing. Compare the aspects of taste (bitter, sweet, salty, 
sour) in regard to their potential for rapid time resolution. 

5. Two ears. Our ability to determine the direction from which a 
sound is Corning is partly based on the difference in time at which 
our two ears detect the sound. Given the speed of sound (350 me- 
ter/second) and the separation between our ears (0.15 meter), 
what difference is expected in the times at which a sound arrives 
at our two ears? How does this difference compare with the time 
resolution of the human hearing system? Would a sensory Sys¬ 
tem that utilized 7TM receptors and G proteins be capable of ad- 
equate time resolution? 

6. Constitutive mutants. What effect within the olfactory system 
would you expect for a mutant in which adenylate cyclase is al- 
ways fully active? What effect within the visual system would 
you expect for a mutant in which guanylate cyclase is alway s fully 
active? 

7. Bottle choice. A widely used method for quantitatively mon- 
itoring rodent behavior with regard to taste is the bottle-choice 
assay. An animal is placed in a cage with two water bottles, one 
of which contains a potential tastant. After a fixed period of time 
(24-48 hours), the amount of water remaining in each bottle is 
measured. Suppose that much less water remains in the bottle 
with the tastant after 48 hours. Do you suspect the tastant to be 
sweet or bitter? 

8. ft s better to he bitter. Some nontoxic plants taste very bitter 
to us. Suggest one or more explanations. 

9. Unexpected consequences. Sildenafil (Viagra) is a drug widely 
used to treat male impotence. Sildenafil exerts its effect by in- 
hibiting a cGMP phosphodiesterase isozyme (PDF5) that is es- 
pecially prevalent in smooth muscle. Interestingly, certain air- 
lines restrict pilots from flying for 24 hours after using sildenafil. 
Suggest a reason for this restriction. 


Chapter Integration Problem 

10. Energy and information. The transmission of sensory infor- 
mation requires the input of free energy. For each sensory sys¬ 
tem (olfaction, gustation, vision, hearing, and touch), identify 
mechanisms for the input of free energy that allow the trans¬ 
mission of sensory information. 

Mechanism Problem 

11. Schiff-base formation. Propose a mechanism for the reaction 
between opsin and 11-ds-retinal. 

Media Problems 

12. Homologous proteins, analogous binding? Odorants bind to 
7 TM receptors, but where they bind (and whether all bind in 
the same way) is unclear. Odorants might, for example, bind on 
the extracellular surface, or, like retinal, they might bind in the 
interior of the transmembrane region. Problem 1 of this chapter 
presents evidence for the direct involvement of residue 206 in 
odorant binding in receptors from mouse and rat. While these 
receptors’ structures are not known in detail, their sequences are 
similar enough to rhodopsin s that the rhodopsin structure can 
be used to infer the approximate location of residue 206. To see 
the likely location, look in the Structural Insights module on 
rhodopsin. Where do you think the mouse and rat receptors bind 
their odorants? 

13. Deodorant? A cAMP phosphodiesterase has been discovered 
that is found predominantly in olfactory sensory neurons (Yan 
et ah, 1995, Proc. Natl. Acad. Sei. 10:9677). The enzyme is ac- 
tivated by Ca^”^. What do you think this enzyme does, and why 
do you think it is regulated by calcium? (Hint: Study the re¬ 
sponse and recovery animations in the Conceptual Insights 
module on signaling pathways.) 


The Immune System 


Dedicated to the memory of Don 
Wiley, a pioneer in unraveling the 
structural basis of immune-system 
function 



Antibody 



Just as Medieval defenders used their weapons and the 
castie walis to defend their city, the immune system 
constantiy battles against foreign invaders such as viruses, 
bactéria, and parasites to defend the organism. Antibody 
molecules provide a key element in the immune system's defensive 
arsenal. For example, specific antibodies can bind to molecules on 
the surfaces of viruses and prevent the viruses from infecting cells. 
Above, an antibody binds to one subunit on hemagglutinin from 
the surface of influenza virus. [(Left) The Granger Collection.] 


33.0.1 The Immune System Adapts, Using the 
Principies of Evolution 


OUTLINE 


We are constantly exposed to an incredible diversity of bactéria, viruses, and 
parasites, many of which would flourish in our cells or extracellular fluids were 
it not for our immune system. Remarkably, we are often even able to defend 
ourselves against organisms that we have never before encountered. How does 
the immune system protect us? The key is our ability to produce more than 10^ 
distinct antibodies and more than 10^^ T-cell receptors, each of which presents a 
different surface for specifically binding a molecule from a for¬ 
eign organism and initiating the destruction of the invader. 

The presence of this remarkable repertoire of defensive 
molecules poses a challenge. What prevents the immune sys¬ 
tem from attacking cells that express molecules normally 
present in our bodies; that is, how does the immune system dis- 
tinguish between nonself and self? We shall examine these 
questions, focusing first on the structures of the proteins par- 
ticipating in the molecular recognition processes and then on 
the mechanisms for selecting cells that express molecules use- 
ful for protecting us from a specific pathogen. Emphasis will 
be on the modular construction of the proteins of the immune 
system—identifying structural motifs and considering how 
spectacular diversity can arise from modular construction. 


33.1 Antibodies Possess Distinct 
Antigen-Binding and Effector Units 

33.2 The Immunoglobulin Fold Consists 
of a Beta-Sandwich Framework 

with Hypervariable Loops 

33.3 Antibodies Bind Specific Molecules 
Through Their Hypervariable Loops 

33.4 Diversity Is Generated by Gene 
Rearrangements 

33.5 Major-Histocompatibility-Complex 
Proteins Present Peptide Antigens 

on Cell Surfaces for Recognition 
by T-Cell Receptors 


The immune system comprises two parallel but interrelated 
Systems. In the humoral immune response, soluble proteins 


o 33.6 Immune Responses Against 
Self-Antigens Are Suppressed 


CHAPTER 53 
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FIGURE 33.1 Immunoglobulín 
productíon. An electron micrograph of a 
plasma cell shows the highly developed 
rough endoplasmic reticuium necessary for 
antibody secretion. [Courtesy of Lynne 
Mercer.] 


called antibodies (immunoglobulins) function as recognition elements that 
bind to foreign molecules and serve as markers signaling foreign invasion 
(Figure 33.1). Antibodies are secreted by plasma cells, which are derived 
from B lymphocytes (B cells). A foreign macromolecule that binds selectively 
to an antibody is called an antigen. In a physiological context, if the bind- 
ing of the foreign molecule stimulates an immune response, that molecule 
is called an immunogen. The specific affinity of an antibody is not for the 
entire macromolecular antigen but for a particular site on the antigen called 
the epitope or antigenic determinant. 

In the cellular immune response, cells called cytotoxic T lymphocytes (also 
commonly called killer T cells) kill cells that display foreign motifs on their 
surfaces. Another class of T cells called helper T lymphocytes contributes to 
both the humoral and the cellular immune responses by stimulating the dif- 
ferentiation and proliferation of appropriate B cells and cytotoxic T cells. 
The celluar immune response is mediated by specific receptors that are ex- 
pressed on the surfaces of the T cells. 

The remarkable ability of the immune system to adapt to an essentially 
limitless set of potential pathogens requires a powerful system for trans- 
forming the immune cells and molecules present in our systems in response 
to the presence of pathogens. This adaptive system operates through the prin¬ 
cipies of evolution, including reproduction with variation followed by selection 
of the most well suited members of a population. 

If the human genome contains, by the latest estimates, only 40,000 
genes, how can the immune system generate more than 10^ different anti¬ 
body proteins and 10^^ T-cell receptors? The answer is found in a novel 
mechanism for generating a highly diverse set of genes from a limited set of 
genetic building blocks. Linking different sets of DNA regions in a combi- 
natorial manner produces many distinct protein-encoding genes that are not 
present in the genome. A rigorous selection process then leaves for prolif¬ 
eration only cells that synthesize proteins determined to be useful in the im¬ 
mune response. The subsequent reproduction of these cells without addi- 
tional recombination serves to enrich the cell population with members 
expressing a particular protein species. 

Criticai to the development of the immune response is the selection 
process, which determines which cells will reproduce. The process comprises 
several stages. In the early stages of the development of an immune response, 
cells expressing molecules that bind tightly to self-molecules are destroyed 
or silenced, whereas cells expressing molecules that do not bind strongly to 
self-molecules and that have the potential for binding strongly to foreign mol¬ 
ecules are preserved. The appearance of an immunogenic invader at a later 
time will stimulate cells expressing immunoglobulins or T-cell receptors that 
bind specifically to elements of that pathogen to reproduce—in evolutionary 
terms, such cells are selected for. Thus, the immune response is based on the 
selection of cells expressing molecules that are specifically effective against a 
particular invader; the response evolves from a population with wide-ranging 
specificities to a more-focused collection of cells and molecules that are well 
suited to defend the host when confronted with that particular challenge. 


33.1 ANTIBODIES POSSESS DISTINCT ANTIGEN-BINDING 
AND EFFECTOR UNITS 

Antibodies are central molecular players in the immune response, and we 
examine them first. A fruitful approach in studying proteins as large as an¬ 
tibodies is to split the protein into fragments that retain activity. In 1959, 






FIGURE 33.2 Immunoglobulín G structure. (A) The three-dimensiondl structure 
of an IgG molecule showing the light chains in yellow and the heavy chains in blue. (B) A 
schematic view of an IgG molecule indicating the positions of the interchain disulfide 
bonds. N, amino terminus; C, carboxyl terminus. 


Rodney Porter showed that immunoglobulin G (IgG), the major antibody in 
serum, can be cleaved into three 50-kd fragments by the limited proteolytic 
action of papain. Two of these fragments bind antigen. They are called Fah 
(F stands for fragment, ah for antigen binding). The other fragment, called 
Fc because it crystallizes readily, does not bind antigen, but it has other im- 
portant biological activities, including the mediation of responses termed 
effector functions. These functions include the initiation of the complement 
Cascade, a process that leads to the lysis of target cells. Although such ef¬ 
fector functions are crucial to the functioning of the immune system, they 
will not be discussed further here. 

How do these fragments relate to the three-dimensional structure 
of whole IgG molecules? Immunoglobulin G consists of two kinds of 
polypeptide chains, a 25-kd light (L) chain and a 50-kd heavy (H) chain 
(Figure 33.2). The subunit composition is L2H2. Each L chain is linked to 
an H chain by a disulfide bond, and the H chains are linked to each other 
by at least one disulfide bond. Examination of the amino acid sequences 
and three-dimensional structures of IgG molecules reveals that each L 
chain comprises two homologous domains, termed immunoglobulin do- 
mains, to be described in detail in Section 33.2. Each H chain has four im¬ 
munoglobulin domains. Overall, the molecule adopts a conformation that 
resembles the letter Y, in which the stem, corresponding to the Fc fragment 
obtained by cleavage with papain, consists of the two carboxyl-terminal 
immunoglobulin domains of each H chain and in which the two arms of 
the Y, corresponding to the two F^b fragments, are formed by the two 
amino-terminal domains of each H chain and the two amino-terminal 
domains of each L chain. The linkers between the stem and the two arms 
consist of relatively extended polypeptide regions within the H chains and 
are quite flexible. 

Papain cleaves the H chains on the carboxyl-terminal side of the disul¬ 
fide bond that links each L and H chain (Figure 33.3). Thus, each F^b con¬ 
sists of an entire L chain and the amino-terminal half of an H chain, whereas 
Fc consists of the carboxyl-terminal halves of both H chains. Each Fab con- 
tains a single antigen-binding site. Because an intact IgG molecule contains 
two Fab components and therefore has two binding sites, it can cross-link 



Papain 

cleavage 

N/ 



Fc 


FIGURE 33.3 Immunoglobulin G 
cleavage. Treatment of intact IgG 
molecules with the protease papain 
resuits in the formation of three large 
fragments: two Fab fragments that retain 
antigen-binding capability and one Fc 
fragment that does not. 



- ^ 924 | - 

CHAPTER 33 • The Imnnune System 



FIGURE 33.4 Antígen cross-línking. 

Because IgG molecules include two 
antigen-binding sites, antibodies can cross- 
link multivalent antigens such as virai 
surfaces. 


multiple antigens (Figure 33.4). Furthermore, the Fc and the two Fab units 
of the intact IgG are joined by flexible polypeptide regions that allow facile 
variation in the angle between the Fab units through a wide range (Figure 
33.5). This kind of mobility, called segmentai flexibility, can enhance the for- 
mation of an antibody-antigen complex by enabling both combining sites 
on an antibody to bind an antigen that possesses multiple binding sites, such 
as a virai coat composed of repeating identical monomers or a bacterial cell 
surface. The combining sites at the tips of the Fab units simply move to 
match the distance between specific determinants on the antigen. 



FIGURE 33.5 Segmentai flexibilíty. The linkages between the Fab and the Fc regions of 
an IgG molecule are flexible, allowing the two antigen-binding sites to adopt a range of 
orientations with respect to one another. This flexibility allows effective interactions with a 
multivalent antigen without requiring that the epitopes on the target be a precise distance 
apart. 


Immunoglobulin G is the antibody present in highest concentration in 
the serum, but other classes of immunoglobulin also are present (Table 
33.1). Each class includes an L chain (either k or X) and a distinct H chain 
(Figure 33.6). The heavy chains in IgG are called y chains, whereas those 
in immunoglobulins A, M, D, and E are called a, jui, 8 , and e, respectively. 
Immunoglobulin M (IgM) is the first class of antibody to appear in the serum 
after exposure to an antigen. The presence of 10 combining sites enables 
IgM to bind especially tightly to antigens containing multiple identical epi¬ 
topes. The strength of an interaction comprising multiple independent 
binding interactions between partners is termed avidity rather than affinity, 
which denotes the binding strength of a single combining site. The pres¬ 
ence of 10 combining sites in IgM compared with 2 sites in IgG enables 
IgM to bind many multivalent antigens that would slip away from IgG. 

Immunoglobulin A (IgA) is the major class of antibody in externai secre- 
tions, such as saliva, tears, bronchial mucus, and intestinal mucus. Thus, 
IgA serves as a first line of defense against bacterial and virai antigens. The 
role of immunoglobulin D (IgD) is not yet known. Immunoglobulin E (IgE) is 


TABLE 33.1 Properties of immunoglobulin classes 


Class 

Serum 

concentration 

(mg/ml) 

Mass 

(kd) 

Scdimcntation 

cocfficicnt(s) 

Light 

chains 

Heavy 

chains 

Chain 

structurc 

IgG 

12 

150 

7 

K or X 

7 

K272 or X 272 

IgA 

3 

180-500 

7, 10, 13 

K or X 

a 

{K 2 a 2 )n or (X 20 i 2 )n 

IgM 

1 

950 

18-20 

K or X 


{k 2 H. 2 )s or (A.2jr2)5 

IgD 

0.1 

175 

7 

K or X 

8 

K 282 or X 282 

IgE 

0.001 

200 

8 

K or X 

e 

K 2 e 2 or X 2^2 


Note: n = 1, 2, or 3. IgM and oligomers of IgA also contain J chains that connect immunoglobulin 
molecules. IgA in secretions has an additional component. 










important in conferring protection against parasites, but IgE also causes 
allergic reactions. IgE-antigen complexes form cross-links with receptors 
on the surfaces of mast cells to trigger a cascade that leads to the release of 
granules containing pharmacologically active molecules. Histamine, one of 
the agents released, induces smooth muscle contraction and stimulates the 
secretion of mucus. 

A comparison of the amino acid sequences of different IgG antibodies 
from human beings or mice shows that the carboxyl-terminal half of the L 
chains and the carboxyl-terminal three-quarters of the H chains are very 
similar in all of the antibodies. Importantly, the amino-terminal domain of 
each chain is more variable, including three stretches of approximately 7 to 
12 amino acids within each chain that are hypervariable, as shown for the 
H chain in Figure 33.7. The amino-terminal immunglobulin domain of each 
chain is thus referred to as the variable region, whereas the remaining im- 
munoglobulin domains are much more similar in all antibodies and are re¬ 
ferred to as constant regions (Figure 33.8). 


IgD IgE 




FIGURE 33.6 Classes of ímmuno- 
globulin. Each of five classes of imnnuno- 
globulin has the same light chain (shown 
in yellow) combined with a different 
heavy chain ( 7 , a, |jl, 8, or e). Disulfide 
bonds are indicated by green lines. The 
IgA dimer and the IgM pentamer have 
a small polypeptide chain in addition to 
the light and heavy chains. 



Residue 

FIGURE 33.7 Immunoglobulin sequence diversity. A plot of sequence variability as a 
function of position along the sequence of the amino-terminal immunglobulin domain of 
the H chain of human IgG molecules. Three regions (in red) show remarkably high leveis of 
variability. These hypervariable regions correspond to three loops in the immunoglobulin 
domain structure. [After R. A. Goldsby, T. J. Kindt, and B. A. Osborne, Kuby Immunology, 4th ed. 

(W. H. Freennan and Company, 2000), p. 91.] 
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FIGURE 33.8 Variable and constant 
regions. Each L and H chain includes 
one immunoglobulin domain at its amino 
terminus that is quite variable from one 
antibody to another. These domains are 
referred to as Vl and Vh. The remaining 
domains are more constant from one 
antibody to another and are referred to as 
constant domains (Ql, ChI, Ch 2, and Ch3). 
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33.2 THE IMMUNOGLOBULIN FOLD CONSISTS 
OF A BETA-SANDWICH FRAMEWORK 
WITH HYPERVARIABLE LOOPS 


FIGURE 33.9 Immunoglobulin 
fold. An immunoglobulin domain consists 
of a pair of p-sheets linked by a disulfide 
bond and hydrophobic interactions. Three 
hypen/ariable loops lie at one end of the 
structure. 


An IgG molecule consists of a total of 12 immunoglobulin domains. These 
domains have many sequence features in common and adopt a common 
structure, the immunoglobulin fold (Figure 33.9). Remarkably, this same 
structural domain is found in many other proteins that play key roles in the 
immune system. 



The immunoglobulin fold consists of a pair of |3 sheets, each built of 
antiparallel (3 strands, that surround a central hydrophobic core. A single 
disulfide bond bridges the two sheets. Two aspects of this structure are par- 
ticularly important for its function. First, three loops present at one end of 
the structure form a potential binding surface. These loops contain the 
hypervariable sequences present in antibodies and in T-cell receptors (see 
Sections 33.3 and 33.5.2). Variation of the amino acid sequences of these 
loops provides the major mechanism for the generation of the vastly diverse 
set of antibodies and T-cell receptors expressed by the immune system. 
These loops are referred to as hypervariable loops or complementarity- 
determining regions (CDRs). Second, the amino terminus and the carboxyl 
terminus are at opposite ends of the structure, which allows structural do¬ 
mains to be strung together to form chains, as in the L and H chains of 
antibodies. Such chains are present in several other key molecules in the 
immune system. 

Y The immunoglobulin fold is one of the most prevalent domains en- 
coded by the human genome— more than 750 genes encode proteins 
with at least one immunoglobulin fold recognizable at the levei of amino 
acid sequence. Such domains are also common in other multicellular ani¬ 
mais such as flies and nemotodes. However, from inspection of amino acid 
sequence alone, immunoglobulin-fold domains do not appear to be present 
in yeast or plants. However, structurally similar domains are present in these 
organisms, including the key photosynthetic electron-transport protein 
plastocyanin in plants (Section 19.3.2). Thus, the immunglobulin-fold 
family appears to have expanded greatly along evolutionary branches lead- 
ing to animais—particularly, vertebrates. 
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33.3 ANTIBODIES BIND SPECIFIC MOLECULES 
THROUGH THEIR HYPERVARIABLE LOOPS 

For each class of antibody, the amino-terminal immunoglobin domains of 
the L and H chains (the variable domains, designated Vl and Vh) come to- 
gether at the ends of the arms extending from the structure. The positions 
of the complementarity-determining regions are striking. These hypervari- 
able sequences, present in three loops of each domain, come together so that 
all six loops form a single surface at the end of each arm (Figure 33.10). 
Because virtually any Vl can pair with any Vh, ü. very large number of dif- 
ferent binding sites can be constructed by their combinatorial association. 


Antibody Binding 



Side view End-on view 




FIGURE 33.10 Variable domains. 

Two views of the variable domains of the 
L Chain (yellow) and the H chain (blue); 
the complementarity-determining regions 
(CDRs) are shown in red. The six CDRs 
come together to form a binding surface. 
The specificity of the surface is determined 
by the sequences and structures of the 
CDRs. 


33.3.1 X-Ray Analyses Have Revealed How Antibodies 
Bind Antigens 

The results of x-ray crystallographic studies of many large and small anti¬ 
gens bound to Fab molecules have been sources of much insight into the 
structural basis of antibody specificity. The binding of antigens to antibod¬ 
ies is governed by the same principies that govern the binding of substrates 
to enzymes. The apposition of complementary shapes results in numerous 
contacts between amino acids at the binding surfaces of both molecules. 
Numerous hydrogen bonds, electrostatic interactions, and van der Waals 
interactions, reinforced by hydrophobic interactions, combine to give spe- 
cific and strong binding. 

A few aspects of antibody binding merit specific attention, inasmuch as 
they relate directly to the structure of immunoglobulins. The binding site 
on the antibody has been found to incorporate some or all of the CDRs in 
the variable domains of the antibody. Small molecules (e.g., octapeptides) 
are likely to make contact with fewer CDRs, with perhaps 15 residues of 
the antibody participating in the binding interaction. Macromolecules of- 
ten make more extensive contact, interacting with all six CDRs and 20 or 
more residues of the antibody. Small molecules often bind in a cleft of the 
antigen-binding region. Macromolecules such as globular proteins tend to 
interact across larger, fairly flat apposed surfaces bearing complementary 
protrusions and depressions. 

A well-studied case of small-molecule binding is seen in an example 
of phosphorylcholine bound to Fab- Crystallographic analysis revealed 









Phosphorylcholine 
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FIGURE 33.11 Bindíngofa 
small antígen. The structure of a complex 
between an Fgb fragment of an antibody 
and its target-in this case, phosphoryl¬ 
choline. Residues from the antibody 
interact with phosphorylcholine through 
hydrogen bonding and electrostatic and 
van der Waals interactions. 



FIGURE 33.12 Antíbodíes agaínst lysozyme. (A) The structures of 
three complexes (i, ii, iii) between Fab fragments (blue and yellow) and hen 
egg-white lysozyme (red) shown with lysozyme in the same orientation in 
each case. The three antibodies recognize completely different epitopes on 
the lysozyme molecule. (B) The Fab fragments from part A with points of 
contact highlighted as space-filling models, revealing the different 
shapes of the antigen-binding sites. 




(A) 












phosphorylcholine bound to a cavity lined by residues from five CD Rs— I 929 | 

two from the L chain and three from the H chain (Figure 33.11). The pos- Generation of Antibody Diversity 

itively charged trimethylammonium group of phosphorylcholine is buried 

inside the wedge-shaped cavity, where it interacts electrostatically with two 

negatively charged glutamate residues. The negatively charged phosphate 

group of phosphorylcholine binds to the positively charged guanidinium 

group of an arginine residue at the mouth of the crevice and to a nearby ly- 

sine residue. The phosphate group is also hydrogen bonded to the hydroxyl 

group of a tyrosine residue and to the guanidinium group of the arginine 

side chain. Numerous van der Waals interactions, such as those made by a 

tryptophan side chain, also stabilize this complex. 

The binding of phosphorylcholine does not significantly change the 
structure of the antibody, yet induced fit plays a role in the formation of 
many antibody-antigen complexes. A malleable binding site can accom- 
modate many more kinds of ligands than can a rigid one. Thus, induced fit 
increases the repertoire of antibody specificities. 


33.3.2 Large Antigens Bind Antibodies with Numerous 
Interactions 

How do large antigens interact with antibodies? A large collection of anti¬ 
bodies raised against hen egg-white lysozyme has been structurally charac- 
terized in great detail (Figure 33.12). Each different antibody binds to a dis- 
tinct surface of lysozyme. Let us examine the interactions present in one of 
these complexes in detail. This antibody binds two polypeptide segments 
that are widely separated in the primary structure, residues 18 to 27 and 116 
to 129 (Figure 33.13). 



FIGURE 33.13 Antibody-proteín 
interactions. The structure of a complex 
between an Fab fragment and lysozyme 
reveals that the binding surfaces are 
complementary in shape over a large area. 
A single residue of lysozyme, glutamine 
121, penetrates more deeply into the 
antibody combining site. 


All six CDRs of the antibody make contact with this epitope. The region 
of contact is quite extensive (about 30 X 20 À). The apposed surfaces are 
rather flat. The only exception is the side chain of glutamine 121 of 
lysozyme, which penetrates deeply into the antibody binding site, where it 
forms a hydrogen bond with a main-chain carbonyl oxygen atom and is sur- 
rounded by three aromatic side chains. The formation of 12 hydrogen bonds 
and numerous van der Waals interactions contributes to the high affinity 
(fCd — 20 nM) of this antibody-antigen interaction. Examination of the F^b 
molecule without bound protein reveals that the structures of the Vl and 
Vh domains change little on binding, although they slide 1 A apart to al- 
low more intimate contact with lysozyme. 


33.4 DIVERSITY IS GENERATED BY GENE 
REARRANGEMENTS 

A mammal such as a mouse or a human being can synthesize large amounts 
of specific antibody against virtually any foreign determinant within a mat- 
ter of days of being exposed to it. We have seen that antibody specificity is 
determined by the amino acid sequences of the variable regions of both light 




and heavy chains, which brings us to the key question: How are different 
variable-region sequences generated? 

The discovery of distinct variable and constant regions in the L and H 
chains raised the possibility that the genes that encode immunoglobulins 
have an unusual architecture that facilitates the generation of a diverse set 
of polypeptide products. In 1965, William Dreyer and Claude Bennett pro- 
posed that multiple V (variable) genes are separate from a single C (constant) 
gene in embryonic (germ-line) DNA. According to their model, one of these 
V genes becomes joined to the C gene in the course of differentiation of the 
antibody-producing cell. A criticai test of this novel hypothesis had to await 
the isolation of pure immunoglobulin mRNA and the development of tech- 
niques for analyzing mammalian genomes. Twenty years later, Susumu 
Tonegawa found that V and C genes are indeed far apart in embryonic DNA 
but are closely associated in the DNA of antibody-producing cells. Thus, 
immunoglobulin genes are rearranged in the differentiation of lymphocytes. 

33.4.1 J (Joining) Genes and D (Diversity) Genes Increase 
Antibody Diversity 

Sequencing studies carried out by Susumu Tonegawa, Philip Leder, and 
Leroy Hood revealed that V genes in embryonic cells do not encode the en- 
tire variable region of L and H chains. Consider, for example, the region 
that encodes the k light-chain family. A tandem array of 40 segments, each 
of which encodes approximately the first 97 residues of the variable domain 
of the L chain, is present on human chromosome 2 (Figure 33.14). 

11 rr 11 JihbMs 


FIGURE 33.14 The k light-chain locus. This part of human chromosome 2 includes an 
array of 40 segments that encode the variable (V) region (approximately residues 1 -97) 
of the light chain, an array of 5 segments that encode the joining (J) region (residues 
98-110), and a single region that encodes the constant (C) region. 


However, the variable region of the L chain extends to residue 110. Where 
is the DNA that encodes the last 13 residues of the V region? For L chains 
in undifferentiated cells, this stretch of DNA is located in an unexpected 
place: near the C gene. It is called the J gene because it joins the V and C 
genes in a differentiated cell. In fact, a tandem array of five J genes is lo¬ 
cated near the C gene in embryonic cells. In the differentiation of an anti¬ 
body-producing cell, a V gene becomes spliced to a J gene to form a com¬ 
plete gene for the variable region (Figure 33.15). RNA splicing generates 



FIGURE 33.15 VJ recombinatíon. A single V gene (in this case, V2) is linked to a J gene 
(here, J4) to form an intact VJ region. The intervening DNA is released in a circular form. 
Because the V and J regions are selected at random and the joint between them is not 
always in exactiy the same place, many VJ combinations can be generated by this process. 














































an mRNA molecule for the complete L chain by linking the coding regions 
for the rearranged VJ unit with that for the C unit (Figure 33.16). 

J genes are important contributors to antibody diversity because they 
encode part of the last hypervariable segment (CDR3). In forming a con- 
tinuous variable-region gene, any of the 40 V genes can become linked to 
any of five J genes. Thus, somatic recombination of these gene segments 
amplifies the diversity already present in the germ line. The linkage between 
V and J is not precisely controlled. Recombination between these genes can 
take place at one of several bases near the codon for residue 95, generating 
additional diversity. A similar array of V and J genes encoding the A light 
chain is present on human chromosome 22. This region includes 30 gene 
segments and four J;^ segments. In addition, this region includes four dis- 
tinct C genes, in contrast with the single C gene in the k locus. 

In human beings, the genes encoding the heavy chain are present on chro¬ 
mosome 14. Remarkably, the variable domain of heavy chains is assembled 
from three rather than two segments. In addition to Vh genes that encode 
residues 1 to 94 and Jh segments that encode residues 98 to 113, this chro- 
mosomal region includes a distinct set of segments that encode residues 95 
to 97 (Figure 33.17). These gene segments are called D for diversity. Some 
27 D segments lie between 51 Vh and 6 Jh segments. The recombination 
process first joins a D segment to a Jh segment; a Vh segment is then joined 
to DJh- a greater variety of antigen-binding patches and clefts can be formed 
by the H chain than by the L chain because the H chain is encoded by three 
rather than two gene segments. Moreover, CDR3 of the H chain is diversi- 
fied by the action of terminal deoxyribonucleotidyl transferase, a special 
DNA polymerase that requires no template. This enzyme inserts extra nu- 
cleotides between Vh and D. The V{D)J recombination of both the L and the 
H chains is executed by specific enzymes present in immune cells. These 
proteins, called RAG-1 and RAG-2, recognize specific DNA sequences 
called recombination signal sequences (RSSs) adjacent to the V, D, and J seg¬ 
ments and facilitate the cleavage and religation of the DNA segments. 
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FIGURE 33.16 Light-chaín expressíon. 

The light-chain protein is expressed by 
transcription of the rearranged gene to 
produce a pre-RNA molecule with the VJ 
and C regions separated. RNA splicing 
removes the intervening sequences to 
produce an mRNA molecule with the VJ 
and C regions linked. Translation of the 
mRNA and processing of the initial protein 
product produces the light chain. 
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FIGURE 33.17 V(D)J recombination. The heavy-chain locus includes an array of 51 V 
segments, 27 D segments, and 6 J segments. Gene rearrangement begins with D-J joining, 
followed by further rearrangement to link the V segment to the DJ segment. 


33.4.2 More Than 10^ Antibodies Can Be Formed 
by Combinatorial Association and Somatic Mutation 

Let us recapitulate the sources of antibody diversity. The germ line contains 
a rather large repertoire of variable-region genes. For k light chains, there 
are about 40 V-segment genes and five J-segment genes. Hence, a 
total of 40 X 5 = 200 kindsofcompleteVK genes can be formed by the com- 
binations of V and J. A similar analysis suggests that at least 120 different 
A light chains can be generated. A larger number of heavy-chain genes can 
be formed because of the role of the D segments. For 51 V, 27 D, and 6 J 
gene segments, the number of complete Vh genes that can be formed is 8262. 
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FIGURE 33.18 B-cell receptor. This 
complex consists of a membrane-bound 
IgM molecule noncovalentiy bound to two 
Ig-a-lg-p heterodimers. The intracellular 
domains of each of the Ig-a and Ig-p 
chains include an immunoreceptor 
tyrosine-based activation motif (ITAM). 



Kinase 

domain 


Phosporylates targets including an inhibitor 
of the transcription factor NF-kB 


FIGURE 33.19 B-cell activation. The binding of multivalent 
antigen such as bacterial or virai surfaces links membrane-bound 
IgM molecules. This oligomerization triggers the phosphorylation 
of tyrosine residues in the ITAM sequences by protein tyrosine 
kinases such as Lyn. After phosphorylation, the ITAMs serve as 
docking sites for Syk, a protein kinase that phosphorylates a 
number of targets, including transcription factors. 


The association of 320 kinds of L chains with 8262 kinds of H chains would 
yield 2.6 X 10^ different antibodies. Variability in the exact points of seg- 
ment joining and other mechanisms increases this value by at least two or- 
ders of magnitude. 

Even more diversity is introduced into antibody chains by somatic 
mutation —that is, the introduction of mutations into the recombined genes. 
In fact, a 1000-fold increase in binding affinity is seen in the course of a 
typical humoral immune response, arising from somatic mutation, a process 
called affinity maturation. The generation of an expanded repertoire leads to 
the selection of antibodies that more precisely fit the antigen. Thus, nature 
draws on each of three sources of diversity—a germ-line repertoire, somatic 
recombination, and somatic mutation—to form the rich variety of anti¬ 
bodies that protect an organism from foreign incursions. 


33.4.3 The Oligomerization of Antibodies Expressed on the 
Surface of Immature B Cells Triggers Antibody Secretion 

The processes heretofore described generate a highly diverse set of antibody 
molecules—a key first step in the generation of an immune response. The 
next stage is the selection of a particular set of antibodies directed against a 
specific invader. How does this selection occur? Each immature B cell, pro- 
duced in the bone marrow, expresses a monomeric form of IgM attached 
to its surface (Figure 33.18). Each cell expresses approximately 10^ IgM 
molecules, but all of these molecules are identical in amino acid sequence and, 
hence, in antigen-binding specificity. Thus, the selection of a particular im¬ 
mature B cell for growth will lead to the amplification of an antibody with 
a unique specificity. The selection process begins with the binding of an 
antigen to the membrane-bound antibody. 

Associated with each membrane-linked IgM molecule are two molecules 
of a heterodimeric membrane protein called Ig-a-Ig-|3 (see Figure 33.18). 
Examination of the amino acid sequences of Ig-a and Ig-(3 is highly in- 
structive. The amino terminus of each protein lies outside the cell and cor- 
responds to a single immunoglobulin, and the carboxyl 
terminus, which lies inside the cell, includes a sequence 
of 18 amino acids called an immunoreceptor tyrosine-based 
activation motif (ITAM) (see Figure 33.18). As its name 
suggests, each ITAM includes key tyrosine residues, 
which are subject to phosphorylation by particular pro¬ 
tein kinases present in immune-system cells. 

A fundamental observation with regard to the mech- 
anism by which the binding of antigen to membrane- 
bound antibody triggers the subsequent steps of the im¬ 
mune response is that oligomerization or clustering of the 
antibody molecules is required (Figure 33.19). The re- 
quirement for oligomerization is reminiscent of the 
dimerization of receptors triggered by growth hormone 
and epidermal growth factor encountered in Sections 15.4 
and 15.4.1; indeed, the associated signaling mechanisms 
appear to be quite similar. The oligomerization of the 
membrane-bound antibodies results in the phosphoryla¬ 
tion of the tyrosine residues within the ITAMs by pro¬ 
tein tyrosine kinases including Lyn, a homolog of Src 
(Section 15.5). The phosphorylated ITAMs serve as 
docking sites for a protein kinase termed spleen tyrosine 
feinase (Syk), which has two SH2 domains that interact 
with the pair of phosphorylated tyrosines in each ITAM. 











Syk, when activated by phosphorylation, proceeds to phosphorylate other 
signal-transduction proteins including an inhibitory subunit of a transcrip- 
tion factor called NF-kB and an isoform of phospholipase C. The signaling 
processes continue downstream to activate gene expression, leading to the 
stimulation of cell growth and initiating further B-cell differentiation. 

T Drugs that modulate the immune system have served as sources, 
of insight into immune-system signaling pathways. For example, 
cyclosporin, a powerful suppressor of the immune system, acts by blocking 
a phosphatase called calcineurin, which normally activates a transcription 
factor called NF-AT by dephosphorylating it. 



The potent immune supression that results reveals how crucial the activity 
of this transcription factor is to the development of an immune response. 
Without drugs such as cyclosporin, organ transplantation would be ex- 
tremely difficult because transplanted tissue expresses a wide range of for- 
eign antigens, which causes the immune system to reject the new tissue. 

The role of oligomerization in the B-cell signaling pathway is illuminated 
when we consider the nature of many antigens presented by pathogens. The 
surfaces of many viruses, bactéria, and parasites are characterized by arrays 
of identical membrane proteins or membrane-linked carbohydrates. Thus, 
most pathogens present multiple binding surfaces that will naturally cause 
membrane-associated antibodies to oligomerize as they bind adjacent epi- 
topes. In addition, the mechanism accounts for the observation that most 
small molecules do not induce an immune response; however, coupling mul¬ 
tiple copies of the small molecule to a large oligomeric protein such as key- 
hole limpet hemocyanin (KLH), which has a molecular mass of dose to 
1 million daltons or more, promotes antibody oligomerization and, hence, 
the production of antibodies against the small-molecule epitope. The large 
protein is called the carrier of the attached chemical group, which is called a 
haptenic determinant. The small foreign molecule by itself is called a hapten. 
Antibodies elicited by attached haptens will bind unattached haptens as well. 

33.4.4 Different Classes of Antibodies Are Formed by the 
Hopping of Vh Genes 

The development of an effective antibody-based immune response depends 
on the secretion into the blood of antibodies that have appropriate effector 
functions. At the beginning of this response, an alternative mRNA splicing 
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FIGURE 33.20 Class swítchíng. Further 
rearrangement of the heavy-chain locus 
resuits in the generation of genes for anti- 
body classes other than IgM. In the case 
shown, rearrangement places the VDJ 
region next to the C 7 I region, resulting 
in the production of IgGl. Note that no 
further rearrangement of the VDJ region 
takes place, so the specificity of the anti- 
body is not affected. 


pathway is activated so that the production of membrane-linked IgM is sup- 
planted by the synthesis of secreted IgM. As noted in Section 33.1, secreted 
IgM is pentameric and has a relatively high avidity for multivalent antigens. 
Later, the antibody-producing cell makes either IgG, IgA, IgD, or IgE of 
the same specificity as the intially secreted IgM. In this switch, the light 
chain is unchanged, as is the variable region of the heavy chain. Only the 
constant region of the heavy chain changes. This step in the differentiation 
of an antibody-producing cell is called class switching (Figure 33.20). In un- 
differentiated cells, the genes for the constant region of each class of heavy 
chain, called Cjj^, C 5 , and 0^ are next to each other. There are eight 

in all, including four genes for the constant regions of 7 chains. A complete 
gene for the heavy chains of IgM antibody is formed by the translocation 
of a Vh gene segment to a DJh gene segment. 

How are other heavy chains formed? Class switching is mediated by a 
gene-rearrangement process that moves a VDJ gene from a site near one C 
gene to a site near another C gene. Importantly, the antigen-binding speci¬ 
ficity is conserved in class switching because the entire V^DJ^ gene is translo- 
cated in an intact form. For example, the antigen-combining specificity of 
IgA produced by a particular cell is the same as that of IgM synthesized at 
an earlier stage of its development. The biological significance of Ch switch¬ 
ing is that a whole recognition domain (the variable domain) is shifted from 
the early constant region (C,jl) to one of several other constant regions that 
mediate different effector functions. 


33.5 MAJOR-HISTOCOIVIPATIBILITY-COIVIPLEX PROTEINS 
PRESENT PEPTIDE ANTIGENS ON CELL SURFACES 
FOR RECOGNITION BY T-CELL RECEPTORS 

Soluble antibodies are highly effective against extracellular pathogens, but 
they confer little protection against microorganisms that are predominantly 
intracellular, such as viruses and mycobacteria (which cause tuberculosis and 
leprosy). These pathogens are shielded from antibodies by the host-cell 
membrane (Figure 33.21). A different and more subtle strategy, cell-mediated 


FIGURE 33.21 Intracellular pathogen. 

An electron micrograph showing 
mycobacteria (arrows) inside an infected 
macrophage. [Courtesy of Dr. Stanley Falkow.] 
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FIGURE 33.22 Presentation of peptides 
from cytosolíc proteíns. Class I MHC 
proteins on the surfaces of most cells 
display peptides that are derived from 
cytosolic proteins by proteolysis. 


immunity, evolved to cope with intracellular pathogens. T ceWs continually 
scan the surfaces of all cells and kill those that exhibit foreign markings. The 
task is not simple; intracellular microorganisms are not so obliging as to in- 
tentionally leave telltale traces on the surface of their host. Quite the con- 
trary, successful pathogens are masters of the art of camouflage. Vertebrates 
have devised an ingenious mechanism—cut and display—to reveal the 
presence of stealthy intruders. Nearly all vertebrate cells exhibit on their 
surfaces a sample of peptides derived from the digestion of proteins in their 
cytosol. These peptides are displayed by integral membrane proteins that 
are encoded by the major histocompatibility complex (MHC). Specifically, 
peptides derived from cytosolic proteins are bound to class I MHC proteins. 

How are these peptides generated and deli ver ed to the plasma mem¬ 
brane? The process starts in the cytosol with the degradation of proteins, 
self proteins as well as those of pathogens (Figure 33.22). Digestion is car- 
ried out by proteasomes (Section 23.2.2). The resulting peptide fragments 
are transported from the cytosol into the lumen of the endoplasmic reticu- 
lum by an ATP-driven pump. In the ER, peptides combine with nascent 
class I MHC proteins; these complexes are then targeted to the plasma 
membrane. 

MHC proteins embedded in the plasma membrane tenaciously grip their 
bound peptides so that they can be touched and scrutinized by T-cell re- 
ceptors on the surface of a killer cell. Foreign peptides bound to class I MHC 
proteins signal that a cell is infected and mark it for destruction by cyto- 
toxic T cells. An assembly consisting of the foreign peptide—MHC com¬ 
plex, the T-cell receptor, and numerous accessory proteins triggers a Cas¬ 
cade that induces apoptosis in the infected cell. Strictly speaking, infected 
cells are not killed but, instead, are triggered to commit suicide to aid the 
organism. 

33.5.1 Peptides Presented by MHC Proteins Occupy a Deep 
Groove Flanked by Alpha Helices 

The three-dimensional structure of a large fragment of a human MHC 
class I protein, human leukocyte antigen A2 (HLA-A2), was solved in 1987 
by Don Wiley and Pamela Bjorkman. Class I MHC proteins consist of 
a 44-kd a chain noncovalently bound to a 12-kd polypeptide called j 82 - 
microglobulin. The a chain has three extracellular domains (a^, a 2 , and 
a transmembrane segment, and a tail that extends into the cytosol (Figure 
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FIGURE 33.23 Class I MHC 


proteín. A protein of this class consists of 
two chains. The a chain begins with two 
domains that include a helices (a-i, ^ 2 ), 
an immunoglobulin domain (a^), a 
transmembrane domain, and a cytoplasmic 
tail. The second chain, p 2 -'Tiicroglobulin, 
adopts an immunoglobulin fold. 


FIGURE 33.25 Anchor residues. (A) The 

amino acid sequences of three peptides 
that bind to the class I MHC protein 
HLA-A2 are shown. Each of these peptides 
has leucine in the second position and 
valine in the carboxyl-terminal position. 

(B) Comparison of the structures of these 
peptides reveals that the amino and 
carboxyl termini as well as the side chains 
of the leucine and valine residues are in 
essentially the same position in each 
peptide, whereas the remainder of the 
structures are quite different. 



FIGURE 33.24 Class I MHC peptide-bíndíng site. The and ^2 domains come 
together to form a groove in which peptides are displayed. The two views shown reveal 
that the peptide is surrounded on three sides by a p sheet and two a helices, but it is 
accessible from the top of the structure. 


33.23). Cleavage by papain of the HL A a chain several residues before the 
transmembrane segment yielded a soluble heterodimeric fragment. The ^ 2 - 
microglobulin subunit and the domains have immunoglobulin folds, al- 
though the pairing of the two domains differs from that in antibodies. The 
ai and a 2 domains exhibit a novel and remarkable architecture. They asso- 
ciate intimately to form a deep groove that serves as the peptide-binding 
site (Figure 33.24). The floor of the groove, which is about 25 À long and 
10 À wide, is formed by eight |3 strands, four from each domain. A long he- 
lix contributed by the domain forms one side, and a helix contributed by 
the a 2 domain forms the other side. This groove is the binding site for the 
presentation of peptides. 

The groove can be filled by a peptide from 8 to 10 residues long in an 
extended conformation. As we shall see (Section 33.5.6), MHC proteins are 
remarkably diverse in the human population; each person expresses as many 
as six distinct class I MHC proteins and many different forms are present 
in different people. The first structure determined, HLA-A2, binds pep¬ 
tides that almost always have leucine in the second position and valine in 
the last position (Figure 33.25). Side chains from the MHC molecule in- 
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teract with the amino and carboxyl termini and with the side chains in these 
two key positions. These residues are often referred to as the anchor residues. 
The other residues are highly variable. Thus, many millions of different 
peptides can be presented by this particular class I MHC protein; the iden- 
tities of only two of the nine residues are crucial for binding. Each class of 
MHC molecules requires a unique set of anchor residues. Thus, a tremen- 
dous range of peptides can be presented by these molecules. Note that one 
face of the bound peptide is exposed to solution vuhere it can be examined by 
other molecules, particularly T-cell receptors. An additional remarkable fea- 
ture of MHC—peptide complexes is their kinetic stability; once bound, a 
peptide is not released, even over a period of days. 

33.5.2 T-Cell Receptors Are Antibody-like Proteins Containing 
Variable and Constant Regions 

We are now ready to consider the receptor that recognizes peptides dis- 
played by MHC proteins on target cells. The T-cell receptor consists of a 
43-kd a chain (To,) joined by a disulfide bond to a 43-kd |3 chain (Tp; Figure 
33.26). Each chain spans the plasma membrane and has a short carboxyl- 
terminal region on the cytosolic side. A small proportion of T cells express 
a receptor consisting of 7 and 8 chains in place of a and (3. T^ and Tp, like 
immunoglobulin L and H chains, consist of variable and constant regions. 
Indeed, these domains of the T-cell receptor are homologous to the V and C 
domains of immunoglobulins. Furthermore, hypervariable sequences present 
in the V regions of T^ and Tp form the binding site for the epitope. 

The genetic architecture of these proteins is similar to that of im¬ 
munoglobulins. The variable region of To, is encoded by about 50 V- 
segment genes and 70 J-segment genes. Tp is encoded by two D-segment 
genes in addition to 57 V and 13 J-segment genes. Again, the diversity of 
component genes and the use of slightly imprecise modes of joining them 
increase the number of distinct proteins formed. At least 
10^^ different specificities could arisefrom combinations of 
this repertoire of genes. Thus, T-cell receptors, like im¬ 
munoglobulins, can recognize a very large number of 
different epitopes. All the receptors on a particular T 
cell have the same specificity. 

How do T cells recognize their targets? The variable 
regions of the a and |3 chains of the T-cell receptor form 
a binding site that recognizes a combined epitope-for- 
eign peptide bound to an MHC protein (Figure 33.27). 

Neither the foreign peptide alone nor the MHC protein 
alone forms a complex with the T-cell receptor. Thus, 
fragments of an intracellular pathogen are presented in 
a context that allows them to be detected, leading to the 
initiation of an appropriate response. 

33.5.3 CD8 on Cytotoxic T Cells Acts in Concert 
with T-Cell Receptors 

The T-cell receptor does not act alone in recognizing and 
mediating the fate of target cells. Cytotoxic T cells also 
express a protein termed CD8 on their surfaces that is 
crucial for the recognition of the class I MHC-peptide 
complex. The abbreviation CD stands for cluster of differ- 
entiation, referring to a cell-surface marker that is used to 
identify a lineage or stage of differentiation. Antibodies 
specific for particular CD proteins have been invaluable 
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FIGURE 33.26 T-cell receptor. 

This protein consists of an a chain and a 
p chain, each of which consists of two 
immunoglobulin domains and a membrane- 
spanning domain. The two chains are 
linked by a disulfide bond. 
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FIGURE 33.27 T-cell receptor-Class I MHC complex. 

The T-cell receptor binds to a class I MHC protein containing a 
bound peptide. The T-cell receptor contacts both the MHC protein 
and the peptide as shown by surfaces exposed when the complex 
is separated (right). These surfaces are colored according to the 
chain that they contact. 







FIGURE 33.28 The coreceptor 
CD8. This dimeric protein extends from the 
surface of a cytotoxic T cell and binds to 
class I MHC molecules that are expressed 
on the surface of the cell that is bound to 
the T cell. The dashed lines represent 
extended polypeptide chains that link the 
immunoglobulin domain of CD8 to the 
membrane. 
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FIGURE 33.29 T-cell receptor complex. 

The T-cell receptor is associated with 
six CD3 molecules: a CD3-7-CD3-e 
heterodimer, a CD3-8-CD3-e heterodimer, 
and two chains of CD3-^. Single ITAM 
sequences are present in the cytoplasmic 
domains of CD 3 - 7 , CD3-8, and CD3-e 
whereas three such sequences are found 
in each CD3-^ chain. 


in folio wing the development of leukocytes and in disco ver ing new interac- 
tions between specific cell types. 

Each chain in the CD 8 dimer contains a domain that resembles an im¬ 
munoglobulin variable domain (Figure 33.28). CD 8 interacts primarily with 
the relatively constant domain of class I MHC proteins. This interaction 
further stabilizes the interactions between the T cell and its target. The cy- 
tosolic tail of CD 8 contains a docking site for Lck, a cytosolic tyrosine ki- 
nase akin to Src. The T-cell receptor itself is associated with six polypep- 
tides that form the CD3 complex (Figure 33.29). The 7 , 8 , and e chains of 
CD3 are homologous to Ig-a and Ig-(3 associated with the B-cell receptor 
(Section 33.4.3); each chain consists of an extracellular immunoglobulin do¬ 
main and an intracellular ITAM region. These chains associate into CD3 
ye and CD3 8 e heterodimers. An additional component, the CD3 ^ chain, 
has only a small extracellular domain and a larger intracellular domain con- 
taining three ITAM sequences. 

On the basis of these components, a model for T-cell activation can be 
envisaged that is closely parallel to the pathway for B-cell activation (Section 
33.3; Figure 33.30). The binding of the T-cell receptor with the class 
I MHC-peptide complex and the concomitant binding of CD 8 from the 
T-cell with the MHC molecule results in the association of the kinase Lck 
with the ITAM substrates of the components of the CD3 complex. Phos- 
phorylation of the tyrosine residues in the ITAM sequences generates dock¬ 
ing sites for a protein kinase called ZAP-70 (for 70-kd 2 :eta-associated 
protein) that is homologous to Syk in B cells. Docked by its two SH 2 
domains, ZAP-70 phosphorylates downstream targets in the signaling Cas¬ 
cade. Additional molecules, including a membrane-bound protein phos- 
phatase called CD45 and a cell-surface protein called CD28, play ancillary 
roles in this process. 
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FIGURE 33.30 T-cell activation. The interaction between the T-cell receptor and a class I 
MHC-peptide complex results in the binding of CD8 to the MHC protein, the recruitment of 
the protein tyrosine kinase Lck, and the phosphoylation of tyrosine residues in the ITAM 
sequences of the CD 3 chains. After phosphorylation, the ITAM regions serve as docking sites 
for the protein kinase ZAP- 70 , which phosphoylates protein targets to transmit the signal. 











T-cell activation has two important consequences. First, the activation 
of cytotoxic T cells results in the secretion of perforin. This 70-kd protein 
makes the cell membrane of the target cell permeable by polymerizing to 
form transmembrane pores 10 nm wide (Figure 33.31). The cytotoxic T cell 
then secretes proteases called granzymes into the target cell. These enzymes 
initiate the pathway of apoptosis (Section 18.6.6), leading to the death of 
the target cell and the fragmentation of its DNA, including any virai DNA 
that may be present. Second, after it has stimulated its target cell to com- 
mit suicide, the activated T cell disengages and is stimulated to reproduce. 
Thus, additional T cells that express the same T-cell receptor are generated 
to continue the battle against the invader after these T cells have been iden- 
tified as a suitable weapon. 

33.5.4 Helper T Cells Stimulate Cells That Display Foreign 
Peptides Bound to Class II MHC Proteins 

Not all T cells are cytotoxic. Helper T cells, a different class, stimulate the pro- 
liferation of specific B lymphocytes and cytotoxic T cells and thereby serve as 
partners in determining the immune responses that are produced. The impor- 
tance of helper T cells is graphically revealed by the devastation wrought 
by AIDS, a condition that destroys these cells. Helper T cells, like cytotoxic 
T cells, detect foreign peptides that are presented on cell surfaces by MHC 
proteins. However, the source of the peptides, the MHC proteins that bind 
them, and the transport pathway are different. 

Helper T cells recognize peptides bound to MHC molecules referred 
to as class II. Their helping action is focused on B cells, macrophages, and 
dendritic cells. Class II MHC proteins are expressed only by these antigen- 
presenting cells, unlike class I MHC proteins, which are expressed on nearly 
all cells. The peptides presented by class II MHC proteins do not come from 
the cytosol. Rather, they arisefrom the degradation of proteins that have been 
internalized by endocytosis. Consider, for example, a virus particle that is cap- 
tured by membrane-bound immunoglobulins on the surface of a B cell 
(Figure 33.32). This complex is delivered to an endosome, a membrane- 
enclosed acidic compartment, where it is digested. The resulting peptides 
become associated with class II MHC proteins, which move to the cell 
surface. Peptides from the cytosol cannot reach class II proteins, whereas 
peptides from endosomal compartments cannot reach class I proteins. This 
segregation of displayed peptides is biologically criticai. The association of 
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FIGURE 33.31 Consequences of 
cytotoxíc-T-cell action. An electron 
micrograph showing pores in the 
membrane of a cell that has been 
attacked by a cytotoxic T cell. The pores 
are formed by the polymerization of 
perforin, a protein secreted by the cytotoxic 
T cell. [Courtesy of Dr. Eckhard Podock.] 


Peptides bound to 
class II MHC proteins 



FIGURE 33.32 Presentation of peptides 
from internalized proteins. Antigen- 
presenting cells bind and internalize foreign 
proteins and display peptides that are 
formed from the digestion of these proteins 
in Class II MHC proteins. 










CHAPTER 33 • The Imnnune System 


a foreign peptide with a class II MHC protein signals that a cell has en- 
countered a pathogen and serves as a call for help. In contrast, association with 
a class I MHC protein signals that a cell has succumbed to a pathogen and 
is a call for destruction. 

33.5.5 Helper T Cells Rely on the T-Cell Receptor and CD4 
to Recognize Foreign Peptides on Antigen-Presenting Cells 

The overall structure of a class II MHC molecule is remarkably similar to 
that of a class I molecule. Class II molecules consist of a 33-kd a chain and 
a noncovalently bound 30-kd |3 chain (Figure 33.33). 



FIGURE 33.34 Coreceptor CD4. 

This protein comprises four tandem 
immunoglobulin domains that extend 
from the surface of helper T cells. 


Side view Top view 



a and p chains, each of which has an amino-terminal domain that constitutes half of the 
peptide-binding structure, as well as a carboxyl-terminal immunoglobulin domain. The 


peptide-binding site is similar to that in class I MHC proteins except that it is open at both 
ends, allowing class II MHC proteins to bind longer peptides than those bound by class I. 

Each contains two extracellular domains, a transmembrane segment, and a 
short cytosolic tail. The peptide-binding site is formed by the and (3 1 do¬ 
mains, each of which contributes a long helix and part of a (3 sheet. Thus, 
the same structural elements are present in class I and class II MHC mol¬ 
ecules, but they are combined into polypeptide chains in different ways. 
Class II MHC molecules appear to form stable dimers, unlike class I mol¬ 
ecules, which are monomeric. The peptide-binding site of a class II mole¬ 
cule is open at both ends, and so this groove can accommodate longer pep¬ 
tides than can be bound by class I molecules; typically, peptides between 
13 and 18 residues long are bound. The peptide-binding specificity of each 
class II molecule depends on binding pockets that recognize particular 
amino acids in specific positions along the sequence. 

Helper T cells express T-cell receptors that are produced from the same 
genes as those on cytotoxic T cells. These T-cell receptors interact with class 
II MHC molecules in a manner that is analogous to T-cell-receptor inter- 
action with class I MHC molecules. Nonetheless, helper T cells and cyto¬ 
toxic T cells are distinguished by other proteins that they express on their 
surfaces. In particular, helper T cells express a protein called CD4 instead 
of expressing CD8. CD4 consists of four immunoglobulin domains that ex¬ 
tend from the T-cell surface, as well as a small cytoplasmic region (Figure 
33.34). The amino-terminal immunoglobulin domains of CD4 interact with 
the base on the class II MHC molecule. Thus, helper T cells bind cells 
expressing class II MHC specifically because of the interactions with CD4 
(Figure 33.35). 







(A) Cell presenting cytoplasmically 
derived foreign peptide in class I MHC 



(B) 


Antigen-presenting cell 




FIGURE 33.35 Varíatíons on a theme. (A) Cytotoxic T cells recognize foreign peptides 
presented in class I MHC proteins with the aid of the coreceptor CD8. (B) Helper T cells 
recognize peptides presented in class II MHC proteins by specialized antigen-presenting 
cells with the aid of the coreceptor CD4. 


When a helper T cell binds to an antigen-presenting cell expressing an 
appropriate class II MHC-peptide complex, signaling pathways analogous 
to those in cytotoxic T cells are initiated by the action of the kinase Lck on 
ITAMs in the CD3 molecules associated with the T-cell receptor. How- 
ever, rather than triggering events leading to the death of 
the attached cell, these signaling pathways result in the se- 
cretion of cytokines from the helper cell. Cytokines are a 
family of molecules that include, among others, inter- 
leukin-2 and interferon-y. Cytokines bind to specific re¬ 
ceptor s on the antigen-presenting cell and stimulate 
growth, differentiation, and in regard to plasma cells, 
which are derived from B cells, antibody secretion 
(Figure 33.36). Thus, the internalization and presenta- 
tion of parts of a foreign pathogen help to generate a lo¬ 
cal environment in which cells taking part in the defense 
against this pathogen can flourish through the action of 
helper T cells. 

33.5.6 MHC Proteins Are Highly Diverse 

MHC class I and II proteins, the presenters of 
peptides to T cells, were discovered because of 
their role in transplantation rejection. A tissue trans- 
planted from one person to another or from one mouse to 
another is usually rejected by the immune system. In con- 
trast, tissues transplanted from one identical twin to an¬ 
other or between mice of an inbred strain are accepted. 

Genetic analyses revealed that rejection occurs when tis¬ 
sues are transplanted between individuais having differ- 
ent genes in the major histocompatibility complex, a clus- 
ter of more than 7 5 genes playing key roles in immunity. 

The 3500-kb span of the MHC is nearly the length of the 
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FIGURE 33.36 Helper T cell action. The engagement of the 
T-cell receptor in helper T cells resuits in the secretion of cytokines. 
These cytokines bind to cytokine receptors expressed on the 
surface of the antigen-presenting cell, stimulating cell growth, 
differentiation, and, in regard to a B cell, antibody secretion. 
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FIGURE 33.37 Polymorphísm ín 
class I MHC proteíns. The positions of 
sites with a high degree of polymorphísm 
in the human population are displayed as 
red spheres on the structure of the amino- 
terminal part of a class I MHC protein. 



FIGURE 33.38 Human ímmunodefidency 
vírus. A schematic diagram of HIV reveals 
its proteins and nucleic acid components. 
The membrane-envelope glycoproteins 
gp41 and gpl20 are shown in dark and 
light green. The virai RNA is shown in red, 
and molecules of reverse transcriptase are 
shown in blue. [After R. C. Gallo. The AIDS virus. 
Copyright © 1987 by Scientific American, Inc. All 
rights reserved.] 


entire E. coli chromosome. The MHC encodes class I proteins (presen- 
ters to cytotoxic T cells) and class II proteins (presenters to helper T 
cells), as well as class III proteins (components of the complement Cas¬ 
cade) and many other proteins that play key roles in immunity. 

Human beings express six different class I genes (three from each par- 
ent) and six different class II genes. The three loci for class I genes are called 
HLA-A, -B, and -C; those for class II genes are called HLA-DP, -DQ, and 
-DR. These loci are highly polymorphic: many alleles of each are present in 
the population. For example, more than 50 each of HLA-A, -B, and -C al¬ 
leles are known; the numbers discovered increase each year. Hence, the like- 
lihood that two unrelated per sons have identical class I and II proteins is 
very small (<10”"^), accounting for transplantation rejection unless the 
genotypes of donor and acceptor are closely matched in advance. 

Differences between class I proteins are located mainly in the ai and a 2 
domains, which form the peptide-binding site (Figure 33.37). The do- 
main, which interacts with a constant |32-microglobulin is largely conserved. 
Similarly, the differences between class II proteins cluster near the peptide- 
binding groove. Why are MHC proteins so highly variable? Their diversity 
makes possible the presentation of a very wide range of peptides to T cells. A 
particular class I or class II molecule may not be able to bind any of the pep- 
tide fragments of a virai protein. The likelihood of a fit is markedly increased 
by having several kinds (usually six) of each class of presenters in each 
individual. If all members of a species had identical class I or class II mol¬ 
ecules, the population would be much more vulnerable to devastation by a 
pathogen that had evolved to evade presentation. The evolution of the di- 
verse human MHC repertoire has been driven by the selection for individ¬ 
ual members of the species who resist infections to which other members 
of the population may be susceptible. 

33.5.7 Human ímmunodefidency Viruses Subvert the Immune 
System by Destroying Helper T Cells 

In 1981, the first cases of a new disease now called acquired immune defi- 
ciency syndrome (AIDS) were recognized. The victims died of rare infections 
because their immune systems were crippled. The cause was identified two 
years later by Luc Montagnier and coworkers. AIDS is produced by human 
Ímmunodefidency virus (HIV), of which two major classes are known: HIV- 
1 and the much less common HIV-2. Like other retroviruses, HIV contains 
a single-stranded RNA genome that is replicated through a double-stranded 
DNA intermediate. This virai DNA becomes integrated into the genome 
of the host cell. In fact, virai genes are transcribed only when they are inte¬ 
grated into the host DNA. 

The HIV virion is enveloped by a lipid bilayer membrane containing 
two glycoproteins: gp41 spans the membrane and is associated with gpl20, 
which is located on the externai face (Figure 33.38). The core of the virus 
contains two copies of the RNA genome and associated transfer RNAs, and 
several molecules of reverse transcriptase. They are surrounded by many 
copies of two proteins called pl8 and p24. The host cell for HIV is the helper 
T cell. The gpl20 molecules on the membrane of HIV bind to CD4 
molecules on the surface of the helper T cell (Figure 33.39). This interac- 
tion allows the associated virai gp41 to insert its amino-terminal head into 
the host-cell membrane. The virai membrane and the helper-cell membrane 
fuse, and the virai core is released directly into the cytosol. Infection by 
HIV leads to the destruction of helper T cells because the permeability of 
the host plasma membrane is markedly increased by the insertion of virai 
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FIGURE 33.39 HIV receptor. A complex between a modified form of the envelope 
glycoprotein gpl20 from HIV and a peptide corresponding to the two amino-terminal domains 
from the helper T-cell protein CD4 reveals how virai infection of helper T cells is initiated. 


Suppression of Auto-lmmune Response 


glycoproteins and the budding of virus particles. The influx of ions and wa- 
ter disrupts the ionic balance, causing osmotic lysis. 

T The development of an effective AIDS vaccine is difficult owing 
to the antigenic diversity of HIV strains. Because its mechanism for 
replication is quite error prone, a population of HIV presents an ever- 
changing array of coat proteins. Indeed, the mutation rate of HIV is more 
than 65 times as high as that of influenza virus. A major aim now is to de¬ 
fine relatively conserved sequences in these HIV proteins and use them as 
immunogens. 

33.6 IMIVIUNE RESPONSES AGAINST SELF-ANTIGENS 
ARE SUPPRESSED 

The primary function of the immune system is to protect the host from in- 
vasion by foreign organisms. But how does the immune system avoid 
mounting attacks against the host organism? In other words, how does the 
immune system distinguish between self and nonself ? Clearly, proteins from 
the organism itself do not bear some special tag identifying them. Instead, 
selection processes early in the developmental pathways for immune cells 
kill or suppress those immune cells that react strongly with self-antigens. 
The evolutionary paradigm still applies; immune cells that recognize self- 
antigens are generated, but selective mechanisms eliminate such cells in the 
course of development. 

33.6.1 T Cells Are Subject to Positive and Negative Selection 
in the Thymus 

T cells derive their name from the location of their production —the 
thymus, a small organ situated just abo ve the heart. Examination of the de¬ 
velopmental pathways leading to the production of mature cytotoxic and 
helper T cells reveals the selection mechanisms that are crucial for distin- 
guishing self from nonself. These selection criteria are quite stringent; ap- 
proximately 98% of the thymocytes, the precursors of T cells, die before the 
completion of the maturation process. 

Thymocytes produced in the bone marrow do not express the T-cell- 
receptor complex, CD4, or CD8. On relocation to the thymus and re- 
arrangement of the T-cell-receptor genes, the immature thymocyte expresses 
all of these molecules. These cells are first subjected to positive selection 
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FIGURE 33.40 T-cell selection. A population of thymocytes is subjected first to positive 
selection to remove cells that express T-cell receptors that wili not bind to MHC proteins 
expressed by the individual organism. The surviving cells are then subjected to negative 
selection to remove cells that bind strongly to MHC complexes bound to self-peptides. 




33.6.2 Autoimmune Diseases Result 
from the Generation of Immune Responses 
Against Self-Antigens 


(Figure 33.40). Cells for which the T-cell receptor can bind with reasonable 
affinity to either class I or class II MHC molecules survive this selection; 
those for which the T-cell receptor does not participate in such an interac- 
tion undergo apoptosis and die. The affinities of interaction required to pass 
this selection are relatively modest, and so contacts be- 
tween the T-cell receptor and the MHC molecules them- 
selves are sufficient without any significant contribution 
from the bound peptides (which will be derived from pro¬ 
teins in the thymus). The role of the positive selection step 
is to prevent the production of T cells that will not bind to 
any MHC complex present, regardless of thepeptide bound. 

The cell population that survives positive selection is 
subjected to a second step, negative selection. Here, T cells 
that bind with high affinity to MHC complexes bound 
to self-peptides expressed on the surfaces of antigen-pre- 
senting cells in the thymus undergo apoptosis or are oth- 
erwise suppressed. Those that do not bind too avidly to 
any such MHC complex complete development and be- 
come mature cytotoxic T cells (which express only CD8) 
or helper T cells (which express only CD4). The nega¬ 
tive selection step leads to self tolerance; cells that bind an 
MHC-self-peptide complex are removed from the T-cell 
population. Similar mechanisms apply to developing 
B cells, suppressing B cells that express antibodies that 
interact strongly with self-antigens. 


FIGURE 33.41 Consequentes of autoímmuníty. Photo- 
micrographs of an islet of Langerhans (A) in the pancreas of a 
normal mouse and (B) in the pancreas of a mouse with an 
immune response against pancreatic p cells, which resuits in a 
disease resembling insulin-dependent diabetes meilitus in human 
beings. [From M. A. Atkinson and N. K. Maclaren. What causes diabetes? 
Copyright © 1990 by Scientific American, Inc. All rights reserved.] 


T Although thymic selection is remarkably efficient in 
suppressing the immune response to self-antigens, 
failures do occur. Such failures resuits in autoimmune dis¬ 
eases. These diseases include relatively common illnesses 
such as insulin-dependent diabetes meilitus, multiple 
sclerosis, and rheumatoid arthritis. In these illnesses, im¬ 
mune responses against self-antigens result in damage to 
selective tissues that express the antigen (Figure 33.41). 




















In many cases, the cause of the generation of self-reactive antibodies or 
T cells in unclear. However, in other cases, infectious organisms such as 
bactéria or viruses may play a role. Infection leads to the generation of an¬ 
tibodies and T cells that react with many different epitopes from the infec¬ 
tious organism. If one of these antigens closely resembles a self-antigen, an 
autoimmune response can result. For example, Streptococcus infections 
sometimes lead to rheumatic fever owing to the production of antibodies 
to streptococcal antigens that cross-react with exposed molecules in heart 
muscle. 

33.6.3 The Immune System Plays a Role in Câncer Prevention 

T The development of immune responses against proteins encoded 
by our own genomes can be beneficiai under some circumstances. 
Câncer cells have undergone significant changes that often result in the ex- 
pression of proteins that are not normally expressed. For example, the mu- 
tation of genes can generate proteins that do not correspond in amino acid 
sequence to any normal protein. Such proteins may be recognized as for- 
eign, and an immune response will be generated specifically against the cân¬ 
cer cell. Alternatively, câncer cells often produce proteins that are expressed 
during embryonic development but are not expressed or are expressed at 
very low leveis after birth. For example, a membrane glycoprotein protein 
called carcinoembryonic antigen (CEA) appears in the gasterointestinal cells 
of developing fetuses but is not normally expressed at significant leveis af¬ 
ter birth. More than 50% of patients with colorectal câncer have elevated 
serum leveis of CEA. Immune cells recognizing epitopes from such pro¬ 
teins will not be subject to negative selection and, hence, will be present in 
the adult immune repertoire. These cells may play a câncer surveillance role, 
killing cells that overexpress antigens such as CEA and preventing geneti- 
cally damaged cells from developing into tumors. 


SUMMARY 


O To respond effectively to a vast array of pathogens, the immune system 
must be tremendously adaptable. Adaptation by the immune system 
follows the principies of evolution: an enormously diverse set of poten- 
tially useful proteins is generated; these proteins are then subjected to 
intense selection so that only cells that express useful proteins flourish 
and continue development, until an effective immune response to a spe- 
cific invader is generated. 

o Antibodies Possess Distinct Antigen-Binding and Effector Units 

The major immunoglobulin in the serum is immunoglobulin G. An IgG 
protein is a heterotetramer with two heavy chains and two light chains. 
Treatment of IgG molecules with proteases such as papain produces 
three fragments: two F^b fragments that retain antigen-binding activity 
and an F^ fragment that retains the ability to activate effector functions 
such as the initiation of the complement cascade. The Fab fragments in- 
clude the L chain and the amino-terminal half of the H chain; the Fc 
domain is a dimer consisting of the carboxyyl-terminal halves of two H 
chains. Five different classes of antibody—IgG, IgM, IgA, IgD, and 
IgE—differ in their heavy chains and, hence, in their effector functions. 

o The Immunoglobulin Fold Consists of a Beta-Sandwich Framework 
with Hypervariable Loops 

One particular protein fold is found in many of the key proteins of the 
immune system. The immunoglobulin fold consists of a pair of |3 sheets 






that pack against one another, linked by a single disulfide bond. Loops 
projecting from one end of the structure form a binding surface that can 
be varied by changing the amino acid sequences within the loops. 
Domains with immunoglobulin folds are linked to form antibodies 
and other classes of proteins in the immune system including T-cell 
receptors. 

• Antibodies Bind Specific Molecules Through Their Hypervariable Loops 

Two chains come together to form the binding surface of an antibody. 
Three loops from each domain, the complementarity-determining re- 
gions, form an essentially continuous surface that can vary tremen- 
dously in shape, charge, and other characteristics to allow particular an¬ 
tibodies to bind to molecules ranging from small molecules to large 
protein surfaces. 

• Diversity Is Generated by Gene Rearrangements 

The tremendous diversity of the amino acid sequences of antibodies is 
generated by segmentai rearrangements of genes. For antibody k light 
chains, one of 40 variable regions is linked to one of five joining regions. 
The combined VJ unit is then linked to the constant region. Thousands 
of different genes can be generated in this manner. Similar arrays are 
rearranged to form the genes for the heavy chains, but an additional re¬ 
gion called the diversity region lies between the V and the J regions. The 
combination L and H chains, each obtained through such rearranged 
genes, can produce more than 10^ distinct antibodies. Different classes 
of antibodies are also generated by gene rearrangements that lead to class 
switching. Oligomerization of membrane-bound antibody molecules 
initiates a signal-transduction cascade inside B cells. Key steps in this 
signaling process include the phosphorylation of specific tyrosine 
residues in sequences termed immunoreceptor tyrosine-based activa- 
tion motifs (ITAMs), present in proteins that associate with the 
membrane-bound antibodies. 

• Major-Histocompatibility-Complex Proteins Present Peptide Antigens 
on Cell Surfaces for Recognition by T-Cell Receptors 

Intracellular pathogens such as viruses and mycobacteria cannot be eas- 
ily detected. Intracellular proteins are constantly being cut into small 
peptides by proteasomes and displayed in class I major-histocompati- 
bility-complex proteins on cell surfaces. Such peptides lie in a groove 
defined by two helices in the class I MHC proteins. The combination 
of MHC protein and peptide can be bound by an appropriate T-cell re¬ 
ceptor. T-cell receptors resemble the antigen-binding domains of anti¬ 
bodies in structure, and diversity in T-cell-receptor sequence is gener¬ 
ated by V(D)J gene rearrangements. The T-cell receptor recognizes 
features of both the peptide and the MHC molecule that presents it. 
Cytotoxic T cells initiate apoptosis in cells to which they bind through 
T-cell receptor-class I MHC-peptide interactions aided by interactions 
with the coreceptor molecule CD 8. Helper T cells recognize peptides 
presented in class II MHC proteins, a distinct type of MHC protein 
expressed only on antigen-presenting cells such as B cells and 
macrophages. Helper T cells express the coreceptor CD4, rather than 
CD8. CD4 interacts with class II MHC proteins present on antigen- 
presenting cells. Signaling pathways, analogous to those in B cells, are 
initiated by interactions between MHC-peptide complexes and T-cell 
receptors and the CD8 and CD4 coreceptors. Human immunodefi- 
ciency virus damages the immune system by infecting cells that express 
CD4, such as helper T cells. 



• Immune Responses Against Self-Antigens Are Suppressed -1 947 |- 

In principie, the immune system is capable of generating antibodies and Selected Readings 

T-cell receptors that bind to self-molecules; that is, molecules that are 

normally present in a healthy and uninfected individual organism. Se- 

lection mechanisms prevent such self-directed molecules from being ex- 

pressed at high leveis. The selection process includes both positive se- 

lection, to enrich the population of cells that express molecules that have 

the potential to bind foreign antigens in an appropriate context, and 

negative selection, which eliminates cells that express molecules with 

too high an affinity for self-antigens. Autoimmune diseases such as 

insulin-dependent diabetes mellitus can result from amplification of a 

response against a self-antigen. 
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-1 PROBLEMS I- 

1. Energetics and kinetics. Suppose that the dissociation con- 
stant of an Fab-hapten complex is 3X10 ^Mat25°C. 

(a) What is the standard free energy of binding? 

(b) Immunologists often speak of affinity (Ka)> the reciprocai of 
the dissociation constant, in comparing antibodies. What is the 
affinity of this Fab? 

(c) The rate constant of release of hapten from the complex is 
120 s ^. What is the rate constant for association? What does 
the magnitude of this value imply about the extent of structural 
change in the antibody on binding hapten? 

2. Sugar niche. An antibody specific for dextran, a polysac- 
charide of glucose residues, was tested for its binding of glucose 
oligomers. Maximal binding affinity was obtained when the 
oligomer contained six glucose residues. How does the size of 
this site compare with that expected for the binding site on the 
surface of an antibody? 

3. A brilliant emitter. Certain naphthalene derivatives exhibit 
a weak yellow fluorescence when they are in a highly polar en- 
vironment (such as water) and an intense blue fluorescence when 
they are in a markedly nonpolar environment (such as hexane). 
The binding of e-dansyl-lysine to specific antibody is accompa- 
nied by a marked increase in its fluorescence intensity and a shift 
in color from yellow to blue. What does this finding reveal about 
the hapten-antibody complex? 

4. Avidity versus affinity. The standard free energy of binding 
of Fab derived from an antiviral IgG is —7 kcal moP^ ( — 29 kj 
moP^) at 25°C. 

(a) Calculate the dissociation constant of this interaction. 

(b) Predict the dissociation constant of the intact IgG, assum- 
ing that both combining sites of the antibody can interact with 
virai epitopes and that the free-energy cost of assuming a favor- 
able hinge angle is +3 kcal moP^ (12.6 kJ moP^). 

5. Miniantibody. The Fab fragment of an antibody molecule has 
essentially the same affinity for a monovalent hapten as does in¬ 
tact IgG. 

(a) What is the smallest unit of an antibody that can retain the 
specificity and binding affinity of the whole protein? 

(b) Design a compact single-chain protein that is likely to 
specifically bind antigen with high affinity. 

6. Turning on B cells. B lymphocytes, the precursors of plasma 
cells, are triggered to proliferate by the binding of multivalent 
antigens to receptors on their surfaces. The cell-surface recep- 
tors are transmembrane immunoglobulins. Univalent antigens, 
in contrast, do not activate B cells. 

(a) What do these findings reveal about the mechanism of B- 
cell activation? 

(b) How might antibodies be used to activate B cells? 

7. An ingenious cloning strategy. In the cloning of the gene for 
the a chain of the T-cell receptor, T-cell cDNAs were hybridized 
with B-cell mRNAs. What was the purpose of this hybridiza- 
tion step? Can the principie be applied generally? 


8. Instruction. Before the mechanism for generating antibody 
diversity had been established, a mechanism based on protein 
folding around an antigen was proposed, primarily by Linus 
Pauling. In this model, antibodies that had different specifici- 
ties had the same amino acid sequence but were folded in dif¬ 
ferent ways. Propose a test of this model. 

9. Dealing with nonsense. Cells, including immune cells, de¬ 
grade mRNA molecules in which no long open reading frame is 
present. The process is called nonsense-mediated RNA decay. 
Suggest a role for this process in immune cells. 

10. Crystallization. The proteolytic digestion of a population of 
IgG molecules isolated from human serum results in the genera- 
tion of Fab and Fc fragments. Why do Fc fragments crystallize 
more easily than Fab fragments generated from such a population? 

11. Presentation. The amino acid sequence of a small protein is: 

MSRLASKNLIRSDHAGGLLQATYSAVSS- 

IKNTMSFGAWSNAALNDSRDA 

Predict the most likely peptide to be presented by the class I 
MHC molecule HLA-A2. 

Mechanism Problem 

12. Catalytic antibody. Antibody is generated against a transi- 
tion State for the hydrolysis of the following ester. 




Some of these antibodies catalyze the hydrolysis of the ester. 
What amino acid residue might you expect to find in the bind¬ 
ing site on the antibody? 

Chapter Integration Problem 

13. Signaling. Protein tyrosine phosphatases, such as the mole¬ 
cule CD45 expressed in both B cells and T cells, play important 
roles in activating such protein tyrosine kinases as Fyn and Lck, 
which are quite similar to Src. Suggest a mechanism for the ac¬ 
tivation of such protein kinases by the removal of a phosphate 
from a phosphotyrosine residue. 

Data Interpretation Problem 

14. Affinity maturation. A mouse is immunized with an 
oligomeric human protein. Shortly after immunization, a cell line 
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that expresses a single type of antibody molecule (antibody A) 
is derived. The ability of antibody A to bind the human protein 
is assayed with the results shown in the adjoining graph. After 
repeated immunizations with the same protein, another cell line 
is derived that expresses a different antibody (antibody B). The 
results of analyzing the binding of antibody B to the protein also 
are shown. From these data, estimate 

(a) the dissociation constant (fCj) for the complex between the 
protein and antibody A. 

(b) the dissociation constant for the complex between the pro¬ 
tein and antibody B. 

Comparison of the amino acid sequences of antibody A and an¬ 
tibody B reveals them to be identical except for a single amino 
acid. What does this finding suggest about the mechanism by 
which the gene encoding antibody B was generated? 
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33.2 THE IMMUNOGLOBULIN FOLD CONSISTS 
OF A BETA-SANDWICH FRAMEWORK 
WITH HYPERVARIABLE LOOPS 


FIGURE 33.9 Immunoglobulin 
fold. An immunoglobulin domain consists 
of a pair of p-sheets linked by a disulfide 
bond and hydrophobic interactions. Three 
hypen/ariable loops lie at one end of the 
structure. 


An IgG molecule consists of a total of 12 immunoglobulin domains. These 
domains have many sequence features in common and adopt a common 
structure, the immunoglobulin fold (Figure 33.9). Remarkably, this same 
structural domain is found in many other proteins that play key roles in the 
immune system. 



The immunoglobulin fold consists of a pair of |3 sheets, each built of 
antiparallel (3 strands, that surround a central hydrophobic core. A single 
disulfide bond bridges the two sheets. Two aspects of this structure are par- 
ticularly important for its function. First, three loops present at one end of 
the structure form a potential binding surface. These loops contain the 
hypervariable sequences present in antibodies and in T-cell receptors (see 
Sections 33.3 and 33.5.2). Variation of the amino acid sequences of these 
loops provides the major mechanism for the generation of the vastly diverse 
set of antibodies and T-cell receptors expressed by the immune system. 
These loops are referred to as hypervariable loops or complementarity- 
determining regions (CDRs). Second, the amino terminus and the carboxyl 
terminus are at opposite ends of the structure, which allows structural do¬ 
mains to be strung together to form chains, as in the L and H chains of 
antibodies. Such chains are present in several other key molecules in the 
immune system. 

Y The immunoglobulin fold is one of the most prevalent domains en- 
coded by the human genome— more than 750 genes encode proteins 
with at least one immunoglobulin fold recognizable at the levei of amino 
acid sequence. Such domains are also common in other multicellular ani¬ 
mais such as flies and nemotodes. However, from inspection of amino acid 
sequence alone, immunoglobulin-fold domains do not appear to be present 
in yeast or plants. However, structurally similar domains are present in these 
organisms, including the key photosynthetic electron-transport protein 
plastocyanin in plants (Section 19.3.2). Thus, the immunglobulin-fold 
family appears to have expanded greatly along evolutionary branches lead- 
ing to animais—particularly, vertebrates. 
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33.3 ANTIBODIES BIND SPECIFIC MOLECULES 
THROUGH THEIR HYPERVARIABLE LOOPS 

For each class of antibody, the amino-terminal immunoglobin domains of 
the L and H chains (the variable domains, designated Vl and Vh) come to- 
gether at the ends of the arms extending from the structure. The positions 
of the complementarity-determining regions are striking. These hypervari- 
able sequences, present in three loops of each domain, come together so that 
all six loops form a single surface at the end of each arm (Figure 33.10). 
Because virtually any Vl can pair with any Vh, ü. very large number of dif- 
ferent binding sites can be constructed by their combinatorial association. 


- ^927 I - 

Antibody Binding 



Side view End-on view 




FIGURE 33.10 Variable domains. 

Two views of the variable domains of the 
L Chain (yellow) and the H chain (blue); 
the complementarity-determining regions 
(CDRs) are shown in red. The six CDRs 
come together to form a binding surface. 
The specificity of the surface is determined 
by the sequences and structures of the 
CDRs. 


33.3.1 X-Ray Analyses Have Revealed How Antibodies 
Bind Antigens 

The results of x-ray crystallographic studies of many large and small anti¬ 
gens bound to Fab molecules have been sources of much insight into the 
structural basis of antibody specificity. The binding of antigens to antibod¬ 
ies is governed by the same principies that govern the binding of substrates 
to enzymes. The apposition of complementary shapes results in numerous 
contacts between amino acids at the binding surfaces of both molecules. 
Numerous hydrogen bonds, electrostatic interactions, and van der Waals 
interactions, reinforced by hydrophobic interactions, combine to give spe- 
cific and strong binding. 

A few aspects of antibody binding merit specific attention, inasmuch as 
they relate directly to the structure of immunoglobulins. The binding site 
on the antibody has been found to incorporate some or all of the CDRs in 
the variable domains of the antibody. Small molecules (e.g., octapeptides) 
are likely to make contact with fewer CDRs, with perhaps 15 residues of 
the antibody participating in the binding interaction. Macromolecules of- 
ten make more extensive contact, interacting with all six CDRs and 20 or 
more residues of the antibody. Small molecules often bind in a cleft of the 
antigen-binding region. Macromolecules such as globular proteins tend to 
interact across larger, fairly flat apposed surfaces bearing complementary 
protrusions and depressions. 

A well-studied case of small-molecule binding is seen in an example 
of phosphorylcholine bound to Fab- Crystallographic analysis revealed 
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FIGURE 33.11 Bindíngofa 
small antígen. The structure of a complex 
between an Fab fragment of an antibody 
and its target-in this case, phosphoryl- 
choline. Residues from the antibody 
interact with phosphorylcholine through 
hydrogen bonding and electrostatic and 
van der Waals interactions. 




FIGURE 33.12 Antibodíes agaínst lysozyme. (A) The structures of 
three complexes (i, ii, iii) between Fab fragments (blue and yellow) and hen 
egg-white lysozyme (red) shown with lysozyme in the same orientation in 
each case. The three antibodies recognize completely different epitopes on 
the lysozyme molecule. (B) The Fab fragments from part A with points of 
contact highlighted as space-filling models, revealing the different 
shapes of the antigen-binding sites. 




(A) 
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phosphorylcholine bound to a cavity lined by residues from five CD Rs— I 929 | 

two from the L chain and three from the H chain (Figure 33.11). The pos- Generation of Antibody Diversity 

itively charged trimethylammonium group of phosphorylcholine is buried 

inside the wedge-shaped cavity, where it interacts electrostatically with two 

negatively charged glutamate residues. The negatively charged phosphate 

group of phosphorylcholine binds to the positively charged guanidinium 

group of an arginine residue at the mouth of the crevice and to a nearby ly- 

sine residue. The phosphate group is also hydrogen bonded to the hydroxyl 

group of a tyrosine residue and to the guanidinium group of the arginine 

side chain. Numerous van der Waals interactions, such as those made by a 

tryptophan side chain, also stabilize this complex. 

The binding of phosphorylcholine does not significantly change the 
structure of the antibody, yet induced fit plays a role in the formation of 
many antibody-antigen complexes. A malleable binding site can accom- 
modate many more kinds of ligands than can a rigid one. Thus, induced fit 
increases the repertoire of antibody specificities. 


33.3.2 Large Antigens Bind Antibodies with Numerous 
Interactions 

How do large antigens interact with antibodies? A large collection of anti¬ 
bodies raised against hen egg-white lysozyme has been structurally charac- 
terized in great detail (Figure 33.12). Each different antibody binds to a dis- 
tinct surface of lysozyme. Let us examine the interactions present in one of 
these complexes in detail. This antibody binds two polypeptide segments 
that are widely separated in the primary structure, residues 18 to 27 and 116 
to 129 (Figure 33.13). 



FIGURE 33.13 Antibody-proteín 
interactions. The structure of a complex 
between an Fab fragment and lysozyme 
reveals that the binding surfaces are 
complementary in shape over a large area. 
A single residue of lysozyme, glutamine 
121, penetrates more deeply into the 
antibody combining site. 


All six CDRs of the antibody make contact with this epitope. The region 
of contact is quite extensive (about 30 X 20 À). The apposed surfaces are 
rather flat. The only exception is the side chain of glutamine 121 of 
lysozyme, which penetrates deeply into the antibody binding site, where it 
forms a hydrogen bond with a main-chain carbonyl oxygen atom and is sur- 
rounded by three aromatic side chains. The formation of 12 hydrogen bonds 
and numerous van der Waals interactions contributes to the high affinity 
(fCj — 20 nM) of this antibody-antigen interaction. Examination of the F^b 
molecule without bound protein reveals that the structures of the Vl and 
Vh domains change little on binding, although they slide 1 A apart to al- 
low more intimate contact with lysozyme. 


33.4 DIVERSITY IS GENERATED BY GENE 
REARRANGEMENTS 

A mammal such as a mouse or a human being can synthesize large amounts 
of specific antibody against virtually any foreign determinant within a mat- 
ter of days of being exposed to it. We have seen that antibody specificity is 
determined by the amino acid sequences of the variable regions of both light 
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I 950 I and heavy chains, which brings us to the key question: How are different 

CHAPTER 33 • The Inrimune System variable-region sequences generated? 

The discovery of distinct variable and constant regions in the L and H 
chains raised the possibility that the genes that encode immunoglobulins 
have an unusual architecture that facilitates the generation of a diverse set 
of polypeptide products. In 1965, William Dreyer and Claude Bennett pro- 
posed that multiple V (variable) genes are separate from a single C (constant) 
gene in embryonic (germ-line) DNA. According to their model, one of these 
V genes becomes joined to the C gene in the course of differentiation of the 
antibody-producing cell. A criticai test of this novel hypothesis had to await 
the isolation of pure immunoglobulin mRNA and the development of tech- 
niques for analyzing mammalian genomes. Twenty years later, Susumu 
Tonegawa found that V and C genes are indeed far apart in embryonic DNA 
but are closely associated in the DNA of antibody-producing cells. Thus, 
immunoglobulin genes are rearranged in the differentiation of lymphocytes. 

33.4.1 J (Joining) Genes and D (Diversity) Genes Increase 
Antibody Diversity 

Sequencing studies carried out by Susumu Tonegawa, Philip Leder, and 
Leroy Hood revealed that V genes in embryonic cells do not encode the en- 
tire variable region of L and H chains. Consider, for example, the region 
that encodes the k light-chain family. A tandem array of 40 segments, each 
of which encodes approximately the first 97 residues of the variable domain 
of the L chain, is present on human chromosome 2 (Figure 33.14). 

11 rr 11 JihbMs 


FIGURE 33.14 The k líght-chaín locus. This part of human chromosome 2 includes an 
array of 40 segments that encode the variable (V) region (approximately residues 1 -97) 
of the light chain, an array of 5 segments that encode the joining (J) region (residues 
98-110), and a single region that encodes the constant (C) region. 


However, the variable region of the L chain extends to residue 110. Where 
is the DNA that encodes the last 13 residues of the V region? For L chains 
in undifferentiated cells, this stretch of DNA is located in an unexpected 
place: near the C gene. It is called the J gene because it joins the V and C 
genes in a differentiated cell. In fact, a tandem array of five J genes is lo¬ 
cated near the C gene in embryonic cells. In the differentiation of an anti¬ 
body-producing cell, a V gene becomes spliced to a J gene to form a com¬ 
plete gene for the variable region (Figure 33.15). RNA splicing generates 



Vi V, 





FIGURE 33.15 VJ recombinatíon. A single V gene (in this case, V2) is linked to a J gene 
(here, J4) to form an intact VJ region. The inten/ening DNA is released in a circular form. 
Because the V and J regions are selected at random and the joint between them is not 
aiways in exactiy the same place, many VJ combinations can be generated by this process. 
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an mRNA molecule for the complete L chain by linking the coding regions 
for the rearranged VJ unit with that for the C unit (Figure 33.16). 

J genes are important contributors to antibody diversity because they 
encode part of the last hypervariable segment (CDR3). In forming a con- 
tinuous variable-region gene, any of the 40 V genes can become linked to 
any of five J genes. Thus, somatic recombination of these gene segments 
amplifies the diversity already present in the germ line. The linkage between 
V and J is not precisely controlled. Recombination between these genes can 
take place at one of several bases near the codon for residue 95, generating 
additional diversity. A similar array of V and J genes encoding the A light 
chain is present on human chromosome 22. This region includes 30 gene 
segments and four J;^ segments. In addition, this region includes four dis- 
tinct C genes, in contrast with the single C gene in the k locus. 

In human beings, the genes encoding the heavy chain are present on chro¬ 
mosome 14. Remarkably, the variable domain of heavy chains is assembled 
from three rather than two segments. In addition to Vh genes that encode 
residues 1 to 94 and Jh segments that encode residues 98 to 113, this chro- 
mosomal region includes a distinct set of segments that encode residues 95 
to 97 (Figure 33.17). These gene segments are called D for diversity. Some 
27 D segments lie between 51 Vh and 6 Jh segments. The recombination 
process first joins a D segment to a Jh segment; a Vh segment is then joined 
to DJh- a greater variety of antigen-binding patches and clefts can be formed 
by the H chain than by the L chain because the H chain is encoded by three 
rather than two gene segments. Moreover, CDR3 of the H chain is diversi- 
fied by the action of terminal deoxyribonucleotidyl transferase, a special 
DNA polymerase that requires no template. This enzyme inserts extra nu- 
cleotides between Vh and D. The V{D)J recombination of both the L and the 
H chains is executed by specific enzymes present in immune cells. These 
proteins, called RAG-1 and RAG-2, recognize specific DNA sequences 
called recombination signal sequences (RSSs) adjacent to the V, D, and J seg¬ 
ments and facilitate the cleavage and religation of the DNA segments. 


Rearranged gene 

V 1 V 2 J 4 J 5 C 



Transcription 

N/ 

pre-mRNA 

VV ' 

Splicing 

N/ 

mRNA 


Translation 
and Processing 

L-chain protein 

FIGURE 33.16 Light-chaín expression. 

The light-chain protein is expressed by 
transcription of the rearranged gene to 
produce a pre-RNA molecule with the VJ 
and C regions separated. RNA splicing 
removes the intervening sequences to 
produce an mRNA molecule with the VJ 
and C regions linked. Translation of the 
mRNA and processing of the initial protein 
product produces the light chain. 
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FIGURE 33.17 V(D)J recombination. The heavy-chain locus includes an array of 51 V 
segments, 27 D segments, and 6 J segments. Gene rearrangement begins with D-J joining, 
followed by further rearrangement to link the V segment to the DJ segment. 


33.4.2 More Than 10^ Antibodies Can Be Formed 
by Combinatorial Association and Somatic Mutation 

Let us recapitulate the sources of antibody diversity. The germ line contains 
a rather large repertoire of variable-region genes. For k light chains, there 
are about 40 V-segment genes and five J-segment genes. Hence, a 
total of 40 X 5 = 200 kindsofcompleteVK genes can be formed by the com- 
binations of V and J. A similar analysis suggests that at least 120 different 
A light chains can be generated. A larger number of heavy-chain genes can 
be formed because of the role of the D segments. For 51 V, 27 D, and 6 J 
gene segments, the number of complete Vh genes that can be formed is 8262. 
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FIGURE 33.18 B-cell receptor. This 
complex consists of a membrane-bound 
IgM molecule noncovalentiy bound to two 
Ig-a-lg-p heterodimers. The intracellular 
domains of each of the Ig-a and Ig-p 
chains include an immunoreceptor 
tyrosine-based activation motif (ITAM). 



Kinase 

domain 


Phosporylates targets including an inhibitor 
of the transcription factor NF-kB 


FIGURE 33.19 B-cell activation. The binding of multivalent 
antigen such as bacterial or virai surfaces links membrane-bound 
IgM molecules. This oligomerization triggers the phosphorylation 
of tyrosine residues in the ITAM sequences by protein tyrosine 
kinases such as Lyn. After phosphorylation, the ITAMs serve as 
docking sites for Syk, a protein kinase that phosphorylates a 
number of targets, including transcription factors. 


The association of 320 kinds of L chains with 8262 kinds of H chains would 
yield 2.6 X 10^ different antibodies. Variability in the exact points of seg- 
ment joining and other mechanisms increases this value by at least two or- 
ders of magnitude. 

Even more diversity is introduced into antibody chains by somatic 
mutation —that is, the introduction of mutations into the recombined genes. 
In fact, a 1000-fold increase in binding affinity is seen in the course of a 
typical humoral immune response, arising from somatic mutation, a process 
called affinity maturation. The generation of an expanded repertoire leads to 
the selection of antibodies that more precisely fit the antigen. Thus, nature 
draws on each of three sources of diversity—a germ-line repertoire, somatic 
recombination, and somatic mutation—to form the rich variety of anti¬ 
bodies that protect an organism from foreign incursions. 


33.4.3 The Oligomerization of Antibodies Expressed on the 
Surface of Immature B Cells Triggers Antibody Secretion 

The processes heretofore described generate a highly diverse set of antibody 
molecules—a key first step in the generation of an immune response. The 
next stage is the selection of a particular set of antibodies directed against a 
specific invader. How does this selection occur? Each immature B cell, pro- 
duced in the bone marrow, expresses a monomeric form of IgM attached 
to its surface (Figure 33.18). Each cell expresses approximately 10^ IgM 
molecules, but all of these molecules are identical in amino acid sequence and, 
hence, in antigen-binding specificity. Thus, the selection of a particular im¬ 
mature B cell for growth will lead to the amplification of an antibody with 
a unique specificity. The selection process begins with the binding of an 
antigen to the membrane-bound antibody. 

Associated with each membrane-linked IgM molecule are two molecules 
of a heterodimeric membrane protein called Ig-a-Ig-|3 (see Figure 33.18). 
Examination of the amino acid sequences of Ig-a and Ig-|3 is highly in- 
structive. The amino terminus of each protein lies outside the cell and cor- 
responds to a single immunoglobulin, and the carboxyl 
terminus, which lies inside the cell, includes a sequence 
of 18 amino acids called an immunoreceptor tyrosine-based 
activation motif (ITAM) (see Figure 33.18). As its name 
suggests, each ITAM includes key tyrosine residues, 
which are subject to phosphorylation by particular pro¬ 
tein kinases present in immune-system cells. 

A fundamental observation with regard to the mech- 
anism by which the binding of antigen to membrane- 
bound antibody triggers the subsequent steps of the im¬ 
mune response is that oligomerization or clustering of the 
antibody molecules is required (Figure 33.19). The re- 
quirement for oligomerization is reminiscent of the 
dimerization of receptors triggered by growth hormone 
and epidermal growth factor encountered in Sections 15.4 
and 15.4.1; indeed, the associated signaling mechanisms 
appear to be quite similar. The oligomerization of the 
membrane-bound antibodies results in the phosphoryla¬ 
tion of the tyrosine residues within the ITAMs by pro¬ 
tein tyrosine kinases including Lyn, a homolog of Src 
(Section 15.5). The phosphorylated ITAMs serve as 
docking sites for a protein kinase termed spleen tyrosine 
feinase (Syk), which has two SH2 domains that interact 
with the pair of phosphorylated tyrosines in each ITAM. 
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Syk, when activated by phosphorylation, proceeds to phosphorylate other 
signal-transduction proteins including an inhibitory subunit of a transcrip- 
tion factor called NF-kB and an isoform of phospholipase C. The signaling 
processes continue downstream to activate gene expression, leading to the 
stimulation of cell growth and initiating further B-cell differentiation. 

T Drugs that modulate the immune system have served as sources, 
of insight into immune-system signaling pathways. For example, 
cyclosporin, a powerful suppressor of the immune system, acts by blocking 
a phosphatase called calcineurin, which normally activates a transcription 
factor called NF-AT by dephosphorylating it. 



The potent immune supression that results reveals how crucial the activity 
of this transcription factor is to the development of an immune response. 
Without drugs such as cyclosporin, organ transplantation would be ex- 
tremely difficult because transplanted tissue expresses a wide range of for- 
eign antigens, which causes the immune system to reject the new tissue. 

The role of oligomerization in the B-cell signaling pathway is illuminated 
when we consider the nature of many antigens presented by pathogens. The 
surfaces of many viruses, bactéria, and parasites are characterized by arrays 
of identical membrane proteins or membrane-linked carbohydrates. Thus, 
most pathogens present multiple binding surfaces that will naturally cause 
membrane-associated antibodies to oligomerize as they bind adjacent epi- 
topes. In addition, the mechanism accounts for the observation that most 
small molecules do not induce an immune response; however, coupling mul¬ 
tiple copies of the small molecule to a large oligomeric protein such as key- 
hole limpet hemocyanin (KLH), which has a molecular mass of dose to 
1 million daltons or more, promotes antibody oligomerization and, hence, 
the production of antibodies against the small-molecule epitope. The large 
protein is called the carrier of the attached Chemical group, which is called a 
haptenic determinant. The small foreign molecule by itself is called a hapten. 
Antibodies elicited by attached haptens will bind unattached haptens as well. 

33.4.4 Different Classes of Antibodies Are Formed by the 
Hopping of Vh Genes 

The development of an effective antibody-based immune response depends 
on the secretion into the blood of antibodies that have appropriate effector 
functions. At the beginning of this response, an alternative mRNA splicing 
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FIGURE 33.20 Class swítchíng. Further 
rearrangement of the heavy-chain locus 
resuits in the generation of genes for anti- 
body classes other than IgM. In the case 
shown, rearrangement places the VDJ 
region next to the C 7 I region, resulting 
in the production of IgGl. Note that no 
further rearrangement of the VDJ region 
takes place, so the specificity of the anti- 
body is not affected. 


pathway is activated so that the production of membrane-linked IgM is sup- 
planted by the synthesis of secreted IgM. As noted in Section 33.1, secreted 
IgM is pentameric and has a relatively high avidity for multivalent antigens. 
Later, the antibody-producing cell makes either IgG, IgA, IgD, or IgE of 
the same specificity as the intially secreted IgM. In this switch, the light 
chain is unchanged, as is the variable region of the heavy chain. Only the 
constant region of the heavy chain changes. This step in the differentiation 
of an antibody-producing cell is called class switching (Figure 33.20). In un- 
differentiated cells, the genes for the constant region of each class of heavy 
chain, called Cjj^, C 5 , and 0^, are next to each other. There are eight 

in all, including four genes for the constant regions of 7 chains. A complete 
gene for the heavy chains of IgM antibody is formed by the translocation 
of a Vh gene segment to a DJh gene segment. 

How are other heavy chains formed? Class switching is mediated by a 
gene-rearrangement process that moves a VDJ gene from a site near one C 
gene to a site near another C gene. Importantly, the antigen-binding speci¬ 
ficity is conserved in class switching because the entire V^DJ^ gene is translo- 
cated in an intact form. For example, the antigen-combining specificity of 
IgA produced by a particular cell is the same as that of IgM synthesized at 
an earlier stage of its development. The biological significance of Ch switch¬ 
ing is that a whole recognition domain (the variable domain) is shifted from 
the early constant region (Cjj^) to one of several other constant regions that 
mediate different effector functions. 


33.5 MAJOR-HISTOCOIVIPATIBILITY-COIVIPLEX PROTEINS 
PRESENT PEPTIDE ANTIGENS ON CELL SURFACES 
FOR RECOGNITION BY T-CELL RECEPTORS 

Soluble antibodies are highly effective against extracellular pathogens, but 
they confer little protection against microorganisms that are predominantly 
intracellular, such as viruses and mycobacteria (which cause tuberculosis and 
leprosy). These pathogens are shielded from antibodies by the host-cell 
membrane (Figure 33.21). A different and more subtle strategy, cell-mediated 


FIGURE 33.21 Intracellular pathogen. 

An electron micrograph showing 
mycobacteria (arrows) inside an infected 
macrophage. [Courtesy of Dr. Stanley Falkow.] 
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Major Histocompatibility Complex 


FIGURE 33.22 Presentation of peptides 
from cytosolíc proteins. Class I MHC 
proteins on the surfaces of most cells 
display peptides that are derived from 
cytosolic proteins by proteolysis. 


immunity, evolved to cope with intracellular pathogens. T ceWs continually 
scan the surfaces of all cells and kill those that exhibit foreign markings. The 
task is not simple; intracellular microorganisms are not so obliging as to in- 
tentionally leave telltale traces on the surface of their host. Quite the con- 
trary, successful pathogens are masters of the art of camouflage. Vertebrates 
have devised an ingenious mechanism—cut and display—to reveal the 
presence of stealthy intruders. Nearly all vertebrate cells exhibit on their 
surfaces a sample of peptides derived from the digestion of proteins in their 
cytosol. These peptides are displayed by integral membrane proteins that 
are encoded by the major histocompatibility complex (MHC). Specifically, 
peptides derived from cytosolic proteins are bound to class I MHC proteins. 

How are these peptides generated and deli ver ed to the plasma mem¬ 
brane? The process starts in the cytosol with the degradation of proteins, 
self proteins as well as those of pathogens (Figure 33.22). Digestion is car- 
ried out by proteasomes (Section 23.2.2). The resulting peptide fragments 
are transported from the cytosol into the lumen of the endoplasmic reticu- 
lum by an ATP-driven pump. In the ER, peptides combine with nascent 
class I MHC proteins; these complexes are then targeted to the plasma 
membrane. 

MHC proteins embedded in the plasma membrane tenaciously grip their 
bound peptides so that they can be touched and scrutinized by T-cell re- 
ceptors on the surface of a killer cell. Foreign peptides bound to class I MHC 
proteins signal that a cell is infected and mark it for destruction by cyto- 
toxic T cells. An assembly consisting of the foreign peptide—MHC com¬ 
plex, the T-cell receptor, and numerous accessory proteins triggers a Cas¬ 
cade that induces apoptosis in the infected cell. Strictly speaking, infected 
cells are not killed but, instead, are triggered to commit suicide to aid the 
organism. 

33.5.1 Peptides Presented by MHC Proteins Occupy a Deep 
Groove Flanked by Alpha Helices 

The three-dimensional structure of a large fragment of a human MHC 
class I protein, human leukocyte antigen A2 (HLA-A2), was solved in 1987 
by Don Wiley and Pamela Bjorkman. Class I MHC proteins consist of 
a 44-kd a chain noncovalently bound to a 12-kd polypeptide called ^ 2 - 
microglohulin. The a chain has three extracellular domains (a^, a 2 , and a 3 ), 
a transmembrane segment, and a tail that extends into the cytosol (Figure 
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FIGURE 33.23 Class I MHC 


proteín. A protein of this class consists of 
two chains. The a chain begins with two 
domains that include a helices (a-i, ^ 2 ), 
an imnnunoglobulin domain (a^), a 
transmembrane domain, and a cytoplasmic 
tail. The second chain, P 2 -íToicroglobulin, 
adopts an immunoglobulin fold. 


FIGURE 33.25 Anchor resídues. (A) The 

amino acid sequences of three peptides 
that bind to the class I MHC protein 
HLA-A2 are shown. Each of these peptides 
has leucine in the second position and 
valine in the carboxyl-terminal position. 

(B) Comparison of the structures of these 
peptides reveals that the amino and 
carboxyl termini as well as the side chains 
of the leucine and valine residues are in 
essentially the same position in each 
peptide, whereas the remainder of the 
structures are quite different. 


Top view 


Side view 



FIGURE 33.24 Class I MHC peptide-binding site. The and ^2 domains come 
together to form a groove in which peptides are displayed. The two views shown reveal 
that the peptide is surrounded on three sides by a p sheet and two a helices, but it is 
accessible from the top of the structure. 


33.23). Cleavage by papain of the HL A a chain several residues before the 
transmembrane segment yielded a soluble heterodimeric fragment. The ^ 2 - 
microglobulin subunit and the domains have immunoglobulin folds, al- 
though the pairing of the two domains differs from that in antibodies. The 

and a 2 domains exhibit a novel and remarkable architecture. They asso- 
ciate intimately to form a deep groove that serves as the peptide-binding 
site (Figure 33.24). The floor of the groove, which is about 25 A long and 
10 A wide, is formed by eight |3 strands, four from each domain. A long he- 
lix contributed by the domain forms one side, and a helix contributed by 
the a 2 domain forms the other side. This groove is the binding site for the 
presentation of peptides. 

The groove can be filled by a peptide from 8 to 10 residues long in an 
extended conformation. As we shall see (Section 33.5.6), MHC proteins are 
remarkably diverse in the human population; each person expresses as many 
as six distinct class I MHC proteins and many different forms are present 
in different people. The first structure determined, HLA-A2, binds pep¬ 
tides that almost always have leucine in the second position and valine in 
the last position (Figure 33.25). Side chains from the MHC molecule in- 
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teract with the amino and carboxyl termini and with the side chains in these 
two key positions. These residues are often referred to as the anchor residues. 
The other residues are highly variable. Thus, many millions of different 
peptides can be presented by this particular class I MHC protein; the iden- 
tities of only two of the nine residues are crucial for binding. Each class of 
MHC molecules requires a unique set of anchor residues. Thus, a tremen- 
dous range of peptides can be presented by these molecules. Note that one 
face of the bound peptide is exposed to solution vuhere it can be examined by 
other molecules, particularly T-cell receptors. An additional remarkable fea- 
ture of MHC—peptide complexes is their kinetic stability; once bound, a 
peptide is not released, even over a period of days. 

33.5.2 T-Cell Receptors Are Antibody-like Proteins Containing 
Variable and Constant Regions 

We are now ready to consider the receptor that recognizes peptides dis- 
played by MHC proteins on target cells. The T-cell receptor consists of a 
43-kd a chain (To,) joined by a disulfide bond to a 43-kd |3 chain (Tp; Figure 
33.26). Each chain spans the plasma membrane and has a short carboxyl- 
terminal region on the cytosolic side. A small proportion of T cells express 
a receptor consisting of 7 and 8 chains in place of a and (3. T^ and Tp, like 
immunoglobulin L and H chains, consist of variable and constant regions. 
Indeed, these domains of the T-cell receptor are homologous to the V and C 
domains of immunoglobulins. Furthermore, hypervariable sequences present 
in the V regions of T^ and Tp form the binding site for the epitope. 

The genetic architecture of these proteins is similar to that of im¬ 
munoglobulins. The variable region of T^ is encoded by about 50 V- 
segment genes and 70 J-segment genes. Tp is encoded by two D-segment 
genes in addition to 57 V and 13 J-segment genes. Again, the diversity of 
component genes and the use of slightly imprecise modes of joining them 
increase the number of distinct proteins formed. At least 
10^^ different specificities could arisefrom combinations of 
this repertoire of genes. Thus, T-cell receptors, like im¬ 
munoglobulins, can recognize a very large number of 
different epitopes. All the receptors on a particular T 
cell have the same specificity. 

How do T cells recognize their targets? The variable 
regions of the a and |3 chains of the T-cell receptor form 
a binding site that recognizes a combined epitope-for- 
eign peptide bound to an MHC protein (Figure 33.27). 

Neither the foreign peptide alone nor the MHC protein 
alone forms a complex with the T-cell receptor. Thus, 
fragments of an intracellular pathogen are presented in 
a context that allows them to be detected, leading to the 
initiation of an appropriate response. 

33.5.3 CD8 on Cytotoxic T Cells Acts in Concert 
with T-Cell Receptors 

The T-cell receptor does not act alone in recognizing and 
mediating the fate of target cells. Cytotoxic T cells also 
express a protein termed CD8 on their surfaces that is 
crucial for the recognition of the class I MHC-peptide 
complex. The abbreviation CD stands for cluster of differ- 
entiation, referring to a cell-surface marker that is used to 
identify a lineage or stage of differentiation. Antibodies 
specific for particular CD proteins have been invaluable 
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FIGURE 33.26 T-cell receptor. 

This protein consists of an a chain and a 
(3 chain, each of which consists of two 
immunoglobulin domains and a membrane- 
spanning domain. The two chains are 
linked by a disulfide bond. 


T-cell receptor 



T-cell receptor 

(bottom view) 



MHC class I 

(top view) 


MHC class I 


FIGURE 33.27 T-cell receptor-Class I MHC complex. 

The T-cell receptor binds to a class I MHC protein containing a 
bound peptide. The T-cell receptor contacts both the MHC protein 
and the peptide as shown by surfaces exposed when the complex 
is separated (right). These surfaces are colored according to the 
chain that they contact. 
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FIGURE 33.28 The coreceptor 
CD8. This dimeric protein extends from the 
surface of a cytotoxic T cell and binds to 
class I MHC molecules that are expressed 
on the surface of the cell that is bound to 
the T cell. The dashed lines represent 
extended polypeptide chains that link the 
immunoglobulin domain of CD 8 to the 
membrane. 


p a 



FIGURE 33.29 T-cell receptor complex. 

The T-cell receptor is associated with 
six CD3 molecules: a CD3-7-CD3-e 
heterodimer, a CD3-8-CD3-e heterodimer, 
and two chains of CD3-^. Single ITAM 
sequences are present in the cytoplasmic 
domains of CD 3 - 7 , CD3-8, and CD3-e 
whereas three such sequences are found 
in each CD3-^ chain. 


in folio wing the development of leukocytes and in disco ver ing new interac- 
tions between specific cell types. 

Each chain in the CD 8 dimer contains a domain that resembles an im¬ 
munoglobulin variable domain (Figure 33.28). CD 8 interacts primarily with 
the relatively constant domain of class I MHC proteins. This interaction 
further stabilizes the interactions between the T cell and its target. The cy- 
tosolic tail of CD 8 contains a docking site for Lck, a cytosolic tyrosine ki- 
nase akin to Src. The T-cell receptor itself is associated with six polypep- 
tides that form the CD3 complex (Figure 33.29). The 7 , 8 , and e chains of 
CD3 are homologous to Ig-a and Ig-|3 associated with the B-cell receptor 
(Section 33.4.3); each chain consists of an extracellular immunoglobulin do¬ 
main and an intracellular ITAM region. These chains associate into CD3 
ye and CD3 8 e heterodimers. An additional component, the CD3 ^ chain, 
has only a small extracellular domain and a larger intracellular domain con- 
taining three ITAM sequences. 

On the basis of these components, a model for T-cell activation can be 
envisaged that is closely parallel to the pathway for B-cell activation (Section 
33.3; Figure 33.30). The binding of the T-cell receptor with the class 
I MHC-peptide complex and the concomitant binding of CD 8 from the 
T-cell with the MHC molecule results in the association of the kinase Lck 
with the ITAM substrates of the components of the CD3 complex. Phos- 
phorylation of the tyrosine residues in the ITAM sequences generates dock¬ 
ing sites for a protein kinase called ZAP-70 (for 70-kd 2 :eta-associated 
protein) that is homologous to Syk in B cells. Docked by its two SH 2 
domains, ZAP-70 phosphorylates downstream targets in the signaling Cas¬ 
cade. Additional molecules, including a membrane-bound protein phos- 
phatase called CD45 and a cell-surface protein called CD28, play ancillary 
roles in this process. 



Kinase 

donnain 


FIGURE 33.30 T-cell activation. The interaction between the T-cell receptor and a class I 
MHC-peptide complex results in the binding of CD 8 to the MHC protein, the recruitment of 
the protein tyrosine kinase Lck, and the phosphoylation of tyrosine residues in the ITAM 
sequences of the CD3 chains. After phosphorylation, the ITAM regions serve as docking sites 
for the protein kinase ZAP-70, which phosphoylates protein targets to transmit the signal. 
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T-cell activation has two important consequences. First, the activation 
of cytotoxic T cells results in the secretion of perforin. This 70-kd protein 
makes the cell membrane of the target cell permeable by polymerizing to 
form transmembrane pores 10 nm wide (Figure 33.31). The cytotoxic T cell 
then secretes proteases called granzymes into the target cell. These enzymes 
initiate the pathway of apoptosis (Section 18.6.6), leading to the death of 
the target cell and the fragmentation of its DNA, including any virai DNA 
that may be present. Second, after it has stimulated its target cell to com- 
mit suicide, the activated T cell disengages and is stimulated to reproduce. 
Thus, additional T cells that express the same T-cell receptor are generated 
to continue the battle against the invader after these T cells have been iden- 
tified as a suitable weapon. 

33.5.4 Helper T Cells Stimulate Cells That Display Foreign 
Peptides Bound to Class II MHC Proteins 

Not all T cells are cytotoxic. Helper T cells, a different class, stimulate the pro- 
liferation of specific B lymphocytes and cytotoxic T cells and thereby serve as 
partners in determining the immune responses that are produced. The impor- 
tance of helper T cells is graphically revealed by the devastation wrought 
by AIDS, a condition that destroys these cells. Helper T cells, like cytotoxic 
T cells, detect foreign peptides that are presented on cell surfaces by MHC 
proteins. However, the source of the peptides, the MHC proteins that bind 
them, and the transport pathway are different. 

Helper T cells recognize peptides bound to MHC molecules referred 
to as class II. Their helping action is focused on B cells, macrophages, and 
dendritic cells. Class II MHC proteins are expressed only by these antigen- 
presenting cells, unlike class I MHC proteins, which are expressed on nearly 
all cells. The peptides presented by class II MHC proteins do not come from 
the cytosol. Rather, they arisefrom the degradation of proteins that have been 
internalized by endocytosis. Consider, for example, a virus particle that is cap- 
tured by membrane-bound immunoglobulins on the surface of a B cell 
(Figure 33.32). This complex is delivered to an endosome, a membrane- 
enclosed acidic compartment, where it is digested. The resulting peptides 
become associated with class II MHC proteins, which move to the cell 
surface. Peptides from the cytosol cannot reach class II proteins, whereas 
peptides from endosomal compartments cannot reach class I proteins. This 
segregation of displayed peptides is biologically criticai. The association of 
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FIGURE 33.31 Consequences of 
cytotoxic-T-cell action. An electron 
micrograph showing pores in the 
membrane of a cell that has been 
attacked by a cytotoxic T cell. The pores 
are formed by the polymerization of 
perforin, a protein secreted by the cytotoxic 
T cell. [Courtesy of Dr. Eckhard Podock.] 


Peptides bound to 
class II MHC proteins 



FIGURE 33.32 Presentation of peptides 
from internalized proteins. Antigen- 
presenting cells bind and internalize foreign 
proteins and display peptides that are 
formed from the digestion of these proteins 
in Class II MHC proteins. 
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FIGURE 33.34 Coreceptor CD4. 

This protein comprises four tandem 
immunoglobulin domains that extend 
from the surface of helper T cells. 


a foreign peptide with a class II MHC protein signals that a cell has en- 
countered a pathogen and serves as a call for help. In contrast, association with 
a class I MHC protein signals that a cell has succumbed to a pathogen and 
is a call for destruction. 

33.5.5 Helper T Cells Rely on the T-Cell Receptor and CD4 
to Recognize Foreign Peptides on Antigen-Presenting Cells 

The overall structure of a class II MHC molecule is remarkably similar to 
that of a class I molecule. Class II molecules consist of a 33-kd a chain and 
a noncovalently bound 30-kd |3 chain (Figure 33.33). 


Side view 


Top view 



Peptide 



FIGURE 33.33 Class II MHC protein. A class II MHC protein consists of homologous 
a and p chains, each of which has an amino-terminal domain that constitutes half of the 
peptide-binding structure, as well as a carboxyl-terminal immunoglobulin domain. The 
peptide-binding site is similar to that in class I MHC proteins except that it is open at both 
ends, allowing class II MHC proteins to bind longer peptides than those bound by class I. 


Each contains two extracellular domains, a transmembrane segment, and a 
short cytosolic tail. The peptide-binding site is formed by the and Pi do¬ 
mains, each of which contributes a long helix and part of a P sheet. Thus, 
the same structural elements are present in class I and class II MHC mol¬ 
ecules, but they are combined into polypeptide chains in different ways. 
Class II MHC molecules appear to form stable dimers, unlike class I mol¬ 
ecules, which are monomeric. The peptide-binding site of a class II mole¬ 
cule is open at both ends, and so this groove can accommodate longer pep¬ 
tides than can be bound by class I molecules; typically, peptides between 
13 and 18 residues long are bound. The peptide-binding specificity of each 
class II molecule depends on binding pockets that recognize particular 
amino acids in specific positions along the sequence. 

Helper T cells express T-cell receptors that are produced from the same 
genes as those on cytotoxic T cells. These T-cell receptors interact with class 
II MHC molecules in a manner that is analogous to T-cell-receptor inter- 
action with class I MHC molecules. Nonetheless, helper T cells and cyto¬ 
toxic T cells are distinguished by other proteins that they express on their 
surfaces. In particular, helper T cells express a protein called CD4 instead 
of expressing CD8. CD4 consists of four immunoglobulin domains that ex¬ 
tend from the T-cell surface, as well as a small cytoplasmic region (Figure 
33.34). The amino-terminal immunoglobulin domains of CD4 interact with 
the base on the class II MHC molecule. Thus, helper T cells bind cells 
expressing class II MHC specifically because of the interactions with CD4 
(Figure 33.35). 
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(A) Cell presenting cytoplasmically 
derived foreign peptide in class I MHC 



(B) 


Antigen-presenting cell 




FIGURE 33.35 Variations on a theme. (A) Cytotoxic T cells recognize foreign peptides 
presented in class I MHC proteins with the aid of the coreceptor CD8. (B) Helper T cells 
recognize peptides presented in class II MHC proteins by specialized antigen-presenting 
cells with the aid of the coreceptor CD4. 


When a helper T cell binds to an antigen-presenting cell expressmg an 
appropriate class II MHC-peptide complex, signaling pathways analogous 
to those in cytotoxic T cells are initiated by the action of the kinase Lck on 
ITAMs in the CD3 molecules associated with the T-cell receptor. How- 


ever, rather than triggering events leading to the death of 
the attached cell, these signaling pathways result in the se- 
cretion of cytokines from the helper cell. Cytokines are a 
family of molecules that include, among others, inter- 
leukin-2 and interferon-y. Cytokines bind to specific re- 
ceptors on the antigen-presenting cell and stimulate 
growth, differentiation, and in regard to plasma cells, 
which are derived from B cells, antibody secretion 
(Figure 33.36). Thus, the internalization and presenta- 
tion of parts of a foreign pathogen help to generate a lo¬ 
cal environment in which cells taking part in the defense 
against this pathogen can flourish through the action of 
helper T cells. 

33.5.6 MHC Proteins Are Highly Diverse 

MHC class I and II proteins, the presenters of 
peptides to T cells, were discovered because of 
their role in transplantation rejection. A tissue trans- 
planted from one person to another or from one mouse to 
another is usually rejected by the immune system. In con- 
trast, tissues transplanted from one identical twin to an¬ 
other or between mice of an inbred strain are accepted. 
Genetic analyses revealed that rejection occurs when tis¬ 
sues are transplanted between individuais having differ- 
ent genes in the major histocompatibility complex, a clus- 
ter of more than 7 5 genes playing key roles in immunity. 
The 3500-kb span of the MHC is nearly the length of the 


Antigen-presenting cell 



Kinase 

donnain 


FIGURE 33.36 Helper T cell action. The engagement of the 
T-cell receptor in helper T cells resuits in the secretion of cytokines. 
These cytokines bind to cytokine receptors expressed on the 
surface of the antigen-presenting cell, stimulating cell growth, 
differentiation, and, in regard to a B cell, antibody secretion. 
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33.37 

class I MHC proteins. The positions of 
sites with a high degree of polymorphism 
in the human population are displayed as 
red spheres on the structure of the amino- 
terminal part of a class I MHC protein. 



FIGURE 33.38 Human ímmunodefídency 
vírus. A schematic diagram of HIV reveals 
its proteins and nucleic acid components. 
The membrane-envelope glycoproteins 
gp41 and gpl20 are shown in dark and 
light green. The virai RNA is shown in red, 
and molecules of reverse transcriptase are 
shown in blue. [After R. C. Gallo. The AIDS virus. 
Copyright © 1987 by Scientific American, Inc. All 
rights reserved.] 


entire E. coli chromosome. The MHC encodes class I proteins (presen- 
ters to cytotoxic T cells) and class II proteins (presenters to helper T 
cells), as well as class III proteins (components of the complement Cas¬ 
cade) and many other proteins that play key roles in immunity. 

Human beings express six different class I genes (three from each par- 
ent) and six different class II genes. The three loci for class I genes are called 
HLA-A, -B, and -C; those for class II genes are called HLA-DP, -DQ, and 
-DR. These loci are highly polymorphic: many alleles of each are present in 
the population. For example, more than 50 each of HLA-A, -B, and -C al¬ 
leles are known; the numbers discovered increase each year. Hence, the like- 
lihood that two unrelated per sons have identical class I and II proteins is 
very small (<10~"^), accounting for transplantation rejection unless the 
genotypes of donor and acceptor are closely matched in advance. 

Differences between class I proteins are located mainly in the ai and a 2 
domains, which form the peptide-binding site (Figure 33.37). The do- 
main, which interacts with a constant |32-microglobulin is largely conserved. 
Similarly, the differences between class II proteins cluster near the peptide- 
binding groove. Why are MHC proteins so highly variable? Their diversity 
makes possible the presentation of a very wide range of peptides to T cells. A 
particular class I or class II molecule may not be able to bind any of the pep- 
tide fragments of a virai protein. The likelihood of a fit is markedly increased 
by having several kinds (usually six) of each class of presenters in each 
individual. If all members of a species had identical class I or class II mol¬ 
ecules, the population would be much more vulnerable to devastation by a 
pathogen that had evolved to evade presentation. The evolution of the di- 
verse human MHC repertoire has been driven by the selection for individ¬ 
ual members of the species who resist infections to which other members 
of the population may be susceptible. 

33.5.7 Human ímmunodefídency Viruses Subvert the Immune 
System by Destroying Helper T Cells 

In 1981, the first cases of a new disease now called acquired immune defi- 
ciency syndrome (AIDS) were recognized. The victims died of rare infections 
because their immune systems were crippled. The cause was identified two 
years later by Luc Montagnier and coworkers. AIDS is produced by human 
Ímmunodefídency virus (HIV), of which two major classes are known: HIV- 
1 and the much less common HIV-2. Like other retroviruses, HIV contains 
a single-stranded RNA genome that is replicated through a double-stranded 
DNA intermediate. This virai DNA becomes integrated into the genome 
of the host cell. In fact, virai genes are transcribed only when they are inte¬ 
grated into the host DNA. 

The HIV virion is enveloped by a lipid bilayer membrane containing 
two glycoproteins: gp41 spans the membrane and is associated with gpl20, 
which is located on the externai face (Figure 33.38). The core of the virus 
contains two copies of the RNA genome and associated transfer RNAs, and 
several molecules of reverse transcriptase. They are surrounded by many 
copies of two proteins called pl8 and p24. The host cell for HIV is the helper 
T cell. The gpl20 molecules on the membrane of HIV bind to CD4 
molecules on the surface of the helper T cell (Figure 33.39). This interac- 
tion allows the associated virai gp41 to insert its amino-terminal head into 
the host-cell membrane. The virai membrane and the helper-cell membrane 
fuse, and the virai core is released directly into the cytosol. Infection by 
HIV leads to the destruction of helper T cells because the permeability of 
the host plasma membrane is markedly increased by the insertion of virai 




















7552dc33_921-950 


1/12/02 9:28 AM Page 943 mac76 mac76:385_ri 



FIGURE 33.39 HIV receptor. A complex between a modified form of the envelope 
glycoprotein gpl20 from HIV and a peptide corresponding to the two amino-terminal domains 
from the helper T-cell protein CD4 reveals how virai infection of helper T cells is initiated. 
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glycoproteins and the budding of virus particles. The influx of ions and wa- 
ter disrupts the ionic balance, causing osmotic lysis. 

T The development of an effective AIDS vaccine is difficult owing 
to the antigenic diversity of HIV strains. Because its mechanism for 
replication is quite error prone, a population of HIV presents an ever- 
changing array of coat proteins. Indeed, the mutation rate of HIV is more 
than 65 times as high as that of influenza virus. A major aim now is to de¬ 
fine relatively conserved sequences in these HIV proteins and use them as 
immunogens. 

33.6 IMIVIUNE RESPONSES AGAINST SELF-ANTIGENS 
ARE SUPPRESSED 

The primary function of the immune system is to protect the host from in- 
vasion by foreign organisms. But how does the immune system avoid 
mounting attacks against the host organism? In other words, how does the 
immune system distinguish between self and nonself ? Clearly, proteins from 
the organism itself do not bear some special tag identifying them. Instead, 
selection processes early in the developmental pathways for immune cells 
kill or suppress those immune cells that react strongly with self-antigens. 
The evolutionary paradigm still applies; immune cells that recognize self- 
antigens are generated, but selective mechanisms eliminate such cells in the 
course of development. 

33.6.1 T Cells Are Subject to Positive and Negative Selection 
in the Thymus 

T cells derive their name from the location of their production —the 
thymus, a small organ situated just abo ve the heart. Examination of the de¬ 
velopmental pathways leading to the production of mature cytotoxic and 
helper T cells reveals the selection mechanisms that are crucial for distin- 
guishing self from nonself. These selection criteria are quite stringent; ap- 
proximately 98% of the thymocytes, the precursors of T cells, die before the 
completion of the maturation process. 

Thymocytes produced in the bone marrow do not express the T-cell- 
receptor complex, CD4, or CD8. On relocation to the thymus and re- 
arrangement of the T-cell-receptor genes, the immature thymocyte expresses 
all of these molecules. These cells are first subjected to positive selection 
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Cytotoxic T cell 

(CD8 positive) 


Negative selection 


Cells that bind strongly to 
MHC or MHC-self-peptide 
complexes are eliminated 


Helper T cell 

(CD4 positive) 



FIGURE 33.40 T-cell selection. A population of thymocytes is subjected first to positive 
selection to remove cells that express T-cell receptors that wili not bind to MHC proteins 
expressed by the individual organism. The surviving cells are then subjected to negative 
selection to remove cells that bind strongly to MHC complexes bound to self-peptides. 





33.6.2 Autoimmune Diseases Result 
from the Generation of Immune Responses 
Against Self-Antigens 


(Figure 33.40). Cells for which the T-cell receptor can bind with reasonable 
affinity to either class I or class II MHC molecules survive this selection; 
those for which the T-cell receptor does not participate in such an interac- 
tion undergo apoptosis and die. The affinities of interaction required to pass 
this selection are relatively modest, and so contacts be- 
tween the T-cell receptor and the MHC molecules them- 
selves are sufficient without any significant contribution 
from the bound peptides (which will be derived from pro¬ 
teins in the thymus). The role of the positive selection step 
is to prevent the production of T cells that will not bind to 
any MHC complex present, regardless of thepeptide bound. 

The cell population that survives positive selection is 
subjected to a second step, negative selection. Here, T cells 
that bind with high affinity to MHC complexes bound 
to self-peptides expressed on the surfaces of antigen-pre- 
senting cells in the thymus undergo apoptosis or are oth- 
erwise suppressed. Those that do not bind too avidly to 
any such MHC complex complete development and be- 
come mature cytotoxic T cells (which express only CD8) 
or helper T cells (which express only CD4). The nega¬ 
tive selection step leads to self tolerance; cells that bind an 
MHC-self-peptide complex are removed from the T-cell 
population. Similar mechanisms apply to developing 
B cells, suppressing B cells that express antibodies that 
interact strongly with self-antigens. 


FIGURE 33.41 Consequences of autoímmuníty. Photo- 
micrographs of an islet of Langerhans (A) in the pancreas of a 
normal mouse and (B) in the pancreas of a mouse with an 
immune response against pancreatic (3 cells, which resuits in a 
disease resembling insulin-dependent diabetes meilitus in human 
beings. [From M. A. Atkinson and N. K. Maclaren. What causes diabetes? 
Copyright © 1990 by Scientific American, Inc. All rights reserved.] 


T Although thymic selection is remarkably efficient in 
suppressing the immune response to self-antigens, 
failures do occur. Such failures resuits in autoimmune dis¬ 
eases. These diseases include relatively common illnesses 
such as insulin-dependent diabetes meilitus, multiple 
sclerosis, and rheumatoid arthritis. In these illnesses, im¬ 
mune responses against self-antigens result in damage to 
selective tissues that express the antigen (Figure 33.41). 
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In many cases, the cause of the generation of self-reactive antibodies or 
T cells in unclear. However, in other cases, infectious organisms such as 
bactéria or viruses may play a role. Infection leads to the generation of an¬ 
tibodies and T cells that react with many different epitopes from the infec¬ 
tious organism. If one of these antigens closely resembles a self-antigen, an 
autoimmune response can result. For example, Streptococcus infections 
sometimes lead to rheumatic fever owing to the production of antibodies 
to streptococcal antigens that cross-react with exposed molecules in heart 
muscle. 

33.6.3 The Immune System Plays a Role in Câncer Prevention 

T The development of immune responses against proteins encoded 
by our own genomes can be beneficiai under some circumstances. 
Câncer cells have undergone significant changes that often result in the ex- 
pression of proteins that are not normally expressed. For example, the mu- 
tation of genes can generate proteins that do not correspond in amino acid 
sequence to any normal protein. Such proteins may be recognized as for- 
eign, and an immune response will be generated specifically against the cân¬ 
cer cell. Alternatively, câncer cells often produce proteins that are expressed 
during embryonic development but are not expressed or are expressed at 
very low leveis after birth. For example, a membrane glycoprotein protein 
called carcinoembryonic antigen (CEA) appears in the gasterointestinal cells 
of developing fetuses but is not normally expressed at significant leveis af¬ 
ter birth. More than 50% of patients with colorectal câncer have elevated 
serum leveis of CEA. Immune cells recognizing epitopes from such pro¬ 
teins will not be subject to negative selection and, hence, will be present in 
the adult immune repertoire. These cells may play a câncer surveillance role, 
killing cells that overexpress antigens such as CEA and preventing geneti- 
cally damaged cells from developing into tumors. 
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SUMMARY 


O To respond effectively to a vast array of pathogens, the immune system 
must be tremendously adaptable. Adaptation by the immune system 
follows the principies of evolution: an enormously diverse set of poten- 
tially useful proteins is generated; these proteins are then subjected to 
intense selection so that only cells that express useful proteins flourish 
and continue development, until an effective immune response to a spe- 
cific invader is generated. 

o Antibodies Possess Distinct Antigen-Binding and Effector Units 

The major immunoglobulin in the serum is immunoglobulin G. An IgG 
protein is a heterotetramer with two heavy chains and two light chains. 
Treatment of IgG molecules with proteases such as papain produces 
three fragments: two F^b fragments that retain antigen-binding activity 
and an F^ fragment that retains the ability to activate effector functions 
such as the initiation of the complement cascade. The Fab fragments in- 
clude the L chain and the amino-terminal half of the H chain; the Fc 
domain is a dimer consisting of the carboxyyl-terminal halves of two H 
chains. Five different classes of antibody—IgG, IgM, IgA, IgD, and 
IgE—differ in their heavy chains and, hence, in their effector functions. 

o The Immunoglobulin Fold Consists of a Beta-Sandwich Framework 
with Hypervariable Loops 

One particular protein fold is found in many of the key proteins of the 
immune system. The immunoglobulin fold consists of a pair of |3 sheets 
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that pack against one another, linked by a single disulfide bond. Loops 
projecting from one end of the structure form a binding surface that can 
be varied by changing the amino acid sequences within the loops. 
Domains with immunoglobulin folds are linked to form antibodies 
and other classes of proteins in the immune system including T-cell 
receptors. 

• Antibodies Bind Specific Molecules Through Their Hypervariable Loops 

Two chains come together to form the binding surface of an antibody. 
Three loops from each domain, the complementarity-determining re- 
gions, form an essentially continuous surface that can vary tremen- 
dously in shape, charge, and other characteristics to allow particular an¬ 
tibodies to bind to molecules ranging from small molecules to large 
protein surfaces. 

• Diversity Is Generated by Gene Rearrangements 

The tremendous diversity of the amino acid sequences of antibodies is 
generated by segmentai rearrangements of genes. For antibody k light 
chains, one of 40 variable regions is linked to one of five joining regions. 
The combined VJ unit is then linked to the constant region. Thousands 
of different genes can be generated in this manner. Similar arrays are 
rearranged to form the genes for the heavy chains, but an additional re¬ 
gion called the diversity region lies between the V and the J regions. The 
combination L and H chains, each obtained through such rearranged 
genes, can produce more than 10^ distinct antibodies. Different classes 
of antibodies are also generated by gene rearrangements that lead to class 
switching. Oligomerization of membrane-bound antibody molecules 
initiates a signal-transduction cascade inside B cells. Key steps in this 
signaling process include the phosphorylation of specific tyrosine 
residues in sequences termed immunoreceptor tyrosine-based activa- 
tion motifs (ITAMs), present in proteins that associate with the 
membrane-bound antibodies. 

• Major-Hístocompatibility-Complex Proteins Present Peptide Antigens 
on Cell Surfaces for Recognition by T-Cell Receptors 

Intracellular pathogens such as viruses and mycobacteria cannot be eas- 
ily detected. Intracellular proteins are constantly being cut into small 
peptides by proteasomes and displayed in class I major-histocompati- 
bility-complex proteins on cell surfaces. Such peptides lie in a groove 
defined by two helices in the class I MHC proteins. The combination 
of MHC protein and peptide can be bound by an appropriate T-cell re¬ 
ceptor. T-cell receptors resemble the antigen-binding domains of anti¬ 
bodies in structure, and diversity in T-cell-receptor sequence is gener¬ 
ated by V(D)J gene rearrangements. The T-cell receptor recognizes 
features of both the peptide and the MHC molecule that presents it. 
Cytotoxic T cells initiate apoptosis in cells to which they bind through 
T-cell receptor-class I MHC-peptide interactions aided by interactions 
with the coreceptor molecule CD 8. Helper T cells recognize peptides 
presented in class II MHC proteins, a distinct type of MHC protein 
expressed only on antigen-presenting cells such as B cells and 
macrophages. Helper T cells express the coreceptor CD4, rather than 
CD8. CD4 interacts with class II MHC proteins present on antigen- 
presenting cells. Signaling pathways, analogous to those in B cells, are 
initiated by interactions between MHC-peptide complexes and T-cell 
receptors and the CD8 and CD4 coreceptors. Human immunodefi- 
ciency virus damages the immune system by infecting cells that express 
CD4, such as helper T cells. 
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In principie, the immune system is capable of generating antibodies and Selected Readings 

T-cell receptors that bind to self-molecules; that is, molecules that are 

normally present in a healthy and uninfected individual organism. Se- 

lection mechanisms prevent such self-directed molecules from being ex- 

pressed at high leveis. The selection process includes both positive se- 

lection, to enrich the population of cells that express molecules that have 

the potential to bind foreign antigens in an appropriate context, and 

negative selection, which eliminates cells that express molecules with 

too high an affinity for self-antigens. Autoimmune diseases such as 

insulin-dependent diabetes mellitus can result from amplification of a 

response against a self-antigen. 
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1. Energetics and kinetics. Suppose that the dissociation con- 
stant of an Fab-hapten complex is 3X10 ^Mat25°G. 

(a) What is the standard free energy of binding? 

(b) Immunologists often speak of affinity (Ka)> the reciprocai of 
the dissociation constant, in comparing antibodies. What is the 
affinity of this Fab? 

(c) The rate constant of release of hapten from the complex is 
120 s What is the rate constant for association? What does 
the magnitude of this value imply about the extent of structural 
change in the antibody on binding hapten? 

2. Sugar niche. An antibody specific for dextran, a polysac- 
charide of glucose residues, was tested for its binding of glucose 
oligomers. Maximal binding affinity was obtained when the 
oligomer contained six glucose residues. How does the size of 
this site compare with that expected for the binding site on the 
surface of an antibody? 

3. A brilliant emitter. Certain naphthalene derivatives exhibit 
a weak yellow fluorescence when they are in a highly polar en- 
vironment (such as water) and an intense blue fluorescence when 
they are in a markedly nonpolar environment (such as hexane). 
The binding of e-dansyl-lysine to specific antibody is accompa- 
nied by a marked increase in its fluorescence intensity and a shift 
in color from yellow to blue. What does this finding reveal about 
the hapten-antibody complex? 

4. Avidity versus affinity. The standard free energy of binding 
of Fab derived from an antiviral IgG is —7 kcal moP^ ( — 29 kj 
moP^) at 25°C. 

(a) Calculate the dissociation constant of this interaction. 

(b) Predict the dissociation constant of the intact IgG, assum- 
ing that both combining sites of the antibody can interact with 
virai epitopes and that the free-energy cost of assuming a favor- 
able hinge angle is +3 kcal moP^ (12.6 kJ moP^). 

5. Miniantibody. The Fab fragment of an antibody molecule has 
essentially the same affinity for a monovalent hapten as does in¬ 
tact IgG. 

(a) What is the smallest unit of an antibody that can retain the 
specificity and binding affinity of the whole protein? 

(b) Design a compact single-chain protein that is likely to 
specifically bind antigen with high affinity. 

6. Turning on B cells. B lymphocytes, the precursors of plasma 
cells, are triggered to proliferate by the binding of multivalent 
antigens to receptors on their surfaces. The cell-surface recep- 
tors are transmembrane immunoglobulins. Univalent antigens, 
in contrast, do not activate B cells. 

(a) What do these findings reveal about the mechanism of B- 
cell activation? 

(b) How might antibodies be used to activate B cells? 

7. An ingenious cloning strategy. In the cloning of the gene for 
the a chain of the T-cell receptor, T-cell cDNAs were hybridized 
with B-cell mRNAs. What was the purpose of this hybridiza- 
tion step? Can the principie be applied generally? 


8. Instruction. Before the mechanism for generating antibody 
diversity had been established, a mechanism based on protein 
folding around an antigen was proposed, primarily by Linus 
Pauling. In this model, antibodies that had different specifici- 
ties had the same amino acid sequence but were folded in dif¬ 
ferent ways. Propose a test of this model. 

9. Dealing with nonsense. Cells, including immune cells, de¬ 
grade mRNA molecules in which no long open reading frame is 
present. The process is called nonsense-mediated RNA decay. 
Suggest a role for this process in immune cells. 

10. Crystallization. The proteolytic digestion of a population of 
IgG molecules isolated from human serum results in the genera- 
tion of Fab Fc fragments. Why do Fc fragments crystallize 
more easily than Fab fragments generated from such a population? 

11. Presentation. The amino acid sequence of a small protein is: 

MSRLASKNLIRSDHAGGLLQATYSAVSS- 

IKNTMSFGAWSNAALNDSRDA 

Predict the most likely peptide to be presented by the class I 
MHC molecule HLA-A2. 

Mechanism Problem 

12. Catalytic antibody. Antibody is generated against a transi- 
tion State for the hydrolysis of the following ester. 




Some of these antibodies catalyze the hydrolysis of the ester. 
What amino acid residue might you expect to find in the bind¬ 
ing site on the antibody? 

Chapter Integration Problem 

13. Signaling. Protein tyrosine phosphatases, such as the mole¬ 
cule CD45 expressed in both B cells and T cells, play important 
roles in activating such protein tyrosine kinases as Fyn and Lck, 
which are quite similar to Src. Suggest a mechanism for the ac¬ 
tivation of such protein kinases by the removal of a phosphate 
from a phosphotyrosine residue. 

Data Interpretation Problem 

14. Affinity maturation. A mouse is immunized with an 
oligomeric human protein. Shortly after immunization, a cell line 
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that expresses a single type of antibody molecule (antibody A) 
is derived. The ability of antibody A to bind the human protein 
is assayed with the results shown in the adjoining graph. After 
repeated immunizations with the same protein, another cell line 
is derived that expresses a different antibody (antibody B). The 
results of analyzing the binding of antibody B to the protein also 
are shown. From these data, estimate 

(a) the dissociation constant (fCa) the complex between the 
protein and antibody A. 

(b) the dissociation constant for the complex between the pro¬ 
tein and antibody B. 

Comparison of the amino acid sequences of antibody A and an¬ 
tibody B reveals them to be identical except for a single amino 
acid. What does this finding suggest about the mechanism by 
which the gene encoding antibody B was generated? 



















